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Innovatii^ research in China 


s the most populated country in the world, Chi- 
na carries an immense disease burden. Thanks 
to the rising life expectancy of its citizens, ur- 
banization, and lifestyle changes that have ac- 
companied rapid economic growth, the disease 
spectrum in China has expanded. Noncom- 
municable chronic conditions such as cardio- 
vascular diseases, diabetes, and cancer are now major 
contributors to ill health. Fortunately, an overhaul of 
the national health care system has been under way, 
with universal health care coverage now available for 
more than 95% of the population. Such a sea change is 
also needed in China’s biomedi- 
cal research environment. This 
requires transforming a system 
that has existed for decades. 

The good news is that China 
is not shying away from this 
challenge. 

Two national megaprojects 
were recently launched to ad- 
dress major bottlenecks: the de- 
velopment of innovative drugs 
and the control of major infec- 
tious diseases. The Major New 
Drug Innovation Program was 
funded with an ambitious goal 
of discovering 100 new drugs by 
2020. Although China has seen 
a reduction in the incidence of 
infectious diseases over the past 
10 years, hepatitis B and drug- 
resistant tuberculosis remain se- 
rious public health threats. The 
2003 outbreak of SAKS in China 
spurred the government to 
bolster early detection, disease 
research, and response systems. These improvements en- 
abled the nation’s rapid response and management of the 
avian influenza virus H7N9 in 2013. The current Ebola 
crisis is a reminder that newly emerging diseases are a 
major global threat. Indeed, China’s Control and Preven- 
tion of Major Infectious Diseases Program was launched 
with the goal of establishing a support system for poten- 
tial emerging diseases, in addition to controlling the in- 
cidence of major infectious diseases such as hepatitis B, 
tuberculosis, and HIV. 

Whereas these initiatives will affect China’s public 
health management, ensuring the implementation and 
success of other future Initiatives will demand an innova- 
tive system of medical research as a new driving force. 


For many years, the Chinese Academy of Medical Sci- 
ences (CAMS) has played a pivotal role in spearheading 
biomedical research in China. As the country’s most com- 
prehensive and only national biomedical research orga- 
nization, CAMS is prepared to lead the national effort by 
introducing a new medical research system. With strong 
support from the National Health and Family Planning 
Commission of China (formerly the Ministry of Health), 
CAMS has established the National Cancer Center, the 
National Center for Cardiovascular Diseases (NCCD), and 
integrated innovation research centers focusing on Sys- 
tem Medicine, Neuroscience, Regenerative Medicine, and 
Pharmaceutical Sciences. The 
goal of these new institutions 
is to integrate basic and clini- 
cal research and to provide new 
funding mechanisms to support 
China’s best scientists and phy- 
sicians. For example, the NCCD 
has a state-of-the-art infrastruc- 
ture with the largest tissue bank- 
ing system in the world. It has 
recruited top talent from differ- 
ent countries as well as in China 
and has forged collaborations 
with the world’s leading univer- 
sities and institutions. And there 
is more on the way: Research 
centers focusing on metabolic 
disorders, aging, inflammation, 
and infectious diseases are in 
the planning stages. In addition 
to strengthening an emphasis on 
frontier medical sciences, these 
centers will be encouraged to 
coordinate national efforts for 
large-scale, long-term transla- 
tional and clinical research that focus on major diseases, 
something that cannot be accomplished by individual sci- 
entists or research groups. 

CAMS is also planning to explore new ways to evalu- 
ate, support, and lead the next generation of medical 
researchers in China. Based on the experiences of suc- 
cessful biomedical research organizations in Europe and 
the United States, CAMS will emphasize an investigator’s 
scientific merits and the strategic priority, novelty, and 
long-term objectives of research projects. 

Building this innovative research system will bring 
China into a new era of research and development. The 
hope is that this will fuel health science advances to ben- 
efit China and the world. - Xuetao Cao 




“...a sea change is also 
needed in China’s biomedical 
research environment.” 
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president of the 
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president of 
Global Alliance 
for Chronic 
Disease ( GACD), 
and president of 
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a At times, I thought he was me. 55 

Stephen Hawking, on Facebook, praising actor Eddie Redmayne, who 
played him in the new movie The Theory of Everything. 



New research whaling plan still divisive 



A catcher ship of the Japanese whaling fleet hunts minke whales In the Southern Ocean In 2006. 


J apan’s widely anticipated new whaling research program for the 
Antarctic, unveiled on 18 November, sets a target of capturing 
333 minke whales annually as part of a 12-year-long research ef- 
fort with the stated aim to “achieve conservation” of Antarctic 
marine ecosystem resources while also pursuing “sustainable 
utilization.” The New Scientific Whale Research Program in the 
Antarctic Ocean replaces JARPA II, a similar program halted by the 
International Court of Justice (ICJ) in The Hague last March. JARPA II 
called for taking 850 minkes, 50 humpbacks, and 50 fin whales annually, 
which the ICJ found was not scientifically justified. The new draft plan 
addresses the ICJ’s findings one by one and explains why collecting ac- 
curate data on the ages of minke whales requires killing them; it also in- 
cludes an appendix detailing why 333 whales are needed. Conservation 
organizations immediately denounced the new plan, suggesting that 
the research program remains a fig leaf to sidestep the International 
Whaling Commission’s (IWC’s) 1982 moratorium on commercial whal- 
ing. The draft plan, which is open to revision, will now be presented to 
IWC’s Scientific Committee. http://scim.ag/Japanwhallng 


AROUND THE WORLD 

Big payoff for disease nonprofit 

BETHESDA, MARYLAND | The CyStic 
Fibrosis Foundation announced last week 
that it made $3.3 billion by selling the roy- 
alty rights to treatments that its funding 
helped develop. The charitable organiza- 
tion invested $150 million in Boston-based 
Vertex Pharmaceuticals to create therapies 
for the fatal lung disease. The new deal 
with Royalty Pharma will “supercharge” 
the foundation’s efforts, president and CEO 
Robert Beall said in a 19 November press 
release, by funding more drug discovery 
research and helping care for cystic fibro- 
sis (CF) patients at centers accredited by 
the foundation. This increasingly popular 
profit model for foundations has received 
some criticism, however, for netting large 
royalties while patients pay for costly treat- 
ments. Vertex’s first CF drug to win U.S. 
approval costs $300,000 a year. 

UNAIDS’s push to end epidemic 

GENEVA, SWITZERLAND | No HIV vaccine 
exists, but advances in prevention and 
treatment have led to a growing conviction 
that the HIV/AIDS epidemic can be halted 
with existing tools. A report released 
18 November by the Joint United Nations 
Programme on HIV/AIDS (UNAIDS) 
emphasizes this point, yet cautions that 
the epidemic will continue to grow if 
treatment and prevention efforts remain 
at 2013 levels. The report, Fast-Track: end- 
ing the AIDS epidemic by 2030, notes that 
that although antiretroviral treatments in 
lower income countries have helped slow 
new infections and AIDS-related deaths 
from 2012 to 2013, more ambitious goals 
need to be set. Fast Track says the key is to 
increase three things to 90%: people who 
know their HIV status, receive treatment, 
and adhere to medication so that the virus 
in their blood drops below detectable 
levels. http://sclm.ag/FastTrackAIDS 

Science agency under threat 

BRUSSELS I Belgian researchers have 
launched a petition to save the country’s 
I federal science policy office (BELSPO). 
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Galapagos guardian may shut its doors 

A 50-year-old conservation organization in Ecuador’s Galapagos 
Islands is about to fall off a financial cliff. The Charles Darwin 
Foundation has helped control invasive species such as goats 
and blackberries and helped restore endangered species such 
as giant tortoises and mangrove finches. Yet it’s been on shaky 
financial ground after growing too fast a decade ago. A plan to sell 
off land and build a gift shop seemed to help— but a few months 
ago, a key sale fell through and local authorities shut down the 


store. The foundation has launched an online fundraising campaign 
(#savedarwin) and membership drive: last week, it approached 
its largest supporters for a bailout. But as Science went to press, 
no one had stepped forward to help. If the foundation should close 
before the year’s end, as predicted, that would be "a disaster for 
both fundamental research and conservation,” says board member 
William Sutherland, a conservation biologist at the University of 
Cambridge in the United Kingdom. http://sciin.ag/Darwinshutdown 


The new center-right government plans to 
ax the funding and management agency, 
arguing the move wiil make Belgian sci- 
ence more efficient. Although a large part 
of the nation’s science funds are managed 
at the regional level, BELSPO is in charge 
of 10 federai science institutes, funds basic 
research projects that bring together scien- 
tists from the country’s different language 
communities, and manages Belgium’s con- 
tribution to the European Space Agency, 
among other tasks. In a petition launched 
on 13 November, scientists urged the 
government to reverse the decision, say- 
ing it wouid push the country “below the 
threshold of scientific ... poverty.” The plea 
had some 8500 signatures as Science went 
to press. http://scim.ag/BELSPO 

Kick-starting a moon mission 

LONDON I A British team is asking 
the public to donate £600,000 to send 
an unmanned lander to the moon in 
10 years— and so far, the public is coming 
through. The crowdfunding campaign 
for Lunar Mission One kicked off on 
19 November; as Science went to press, 


it had earned £293,354. Founded by David 
Iron, a former Royal Navy engineering 
officer and financiai consultant specializing 
in space projects, the project has earned 
the support of space scientists and politi- 
cians alike. The probe, which will land at 
the moon’s south pole, will drill 20 to 
100 meters into the lunar surface and bury 



BY THE NUMBERS 

$ 2.6 

billion 

How much It takes to create a new 
drug and bring it to market, 
according to the Tufts Center for the 
Study of Drug Development. 


$186 

million 

A high estimate of new drug 
development costs, according to 
Doctors Without Borders, 
which notes that the Tufts center 
receives some funding from 
pharmaceutical companies. 
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a time capsule in it. To sweeten the deal, 
the team is offering funders a perk: the 
chance to leave some remnant— from a 
digital message to a strand of hair— in 
the capsule. 

Immigration plan nods to science 

WASHINGTON, D.c. | President Barack 
Obama announced changes to U.S. immigra- 
tion policy on 20 November, including two 
moves that affect the research community. 
One would expand the Optional Practical 
Training (OPT) program, which enables for- 
eign students studsdng science, technology, 
engineering, and mathematics (STEM) at 
U.S. universities to work in the United States 
for up to 29 months. Obama also wants to 
make it easier for foreign entrepreneurs 
to enter the country. Both steps will “make 
it easier and faster for high-skilled immi- 
grants, graduates, and entrepreneurs to stay 
and contribute to our economy,” Obama 
said, http://scim.ag/immplan 

Chinese undergrads flock to U.S. 

WASHINGTON, D.c. | Almost as many 
Chinese students earn undergraduate 


degrees in the United States as do gradu- 
ate work, the result of surging interest in 
undergraduate studies and a flattening 
of demand for graduate training. Those 
new numbers— 110,550 versus 115,727— 
come from the Institute of International 
Education (HE), whose annual Open Doors 
report tracks the flow of international 
students into and from the United States. 
A report out last week from the Council 
of Graduate Schools highlighted another 
trend: U.S. graduate schools continue to be 
a huge draw for Indian students. First-year 
enrollment is up 27% this fall, on top of 
a 40% rise last year. But Indians remain 
largely indifferent to U.S. undergraduate 
degrees, HE reports, composing only 3% 
of the foreign-born undergraduate pool 
compared with China’s 30% share. 
http://scim.ag/forstudinUS 

Pledges of new protected areas 

SYDNEY, AUSTRALIA | The International 
Union for Conservation of Nature (lUCN) 
concluded its once-a-decade World Parks 
Congress last week, trumpeting a list of 
more than 70 conservation commitments 
announced by countries around the world. 


including a promise from Bangladesh to 
create the country’s first marine protected 
area and pledges from Madagascar and 
Gabon to expand marine protected areas 
in their territorial waters. “There has 
been a willingness to move beyond words 
to action,” said lUCN Director General 
Julia Marton-Lefevre during the closing 
ceremony. The commitments were seen 
as a step toward achieving new conserva- 
tion goals set out by the congress in a 
document dubbed The Promise of Sydney, 
which calls for protecting at least 17% of 
the world’s land and 10% of its oceans by 
2020. The 8-day congress attracted 6000 
participants from more than 170 countries. 
http://scim.ag/WorldParks 


NEWSMAKERS 

French chemist to head ITER 

ITER, the €13 billion international fusion 
reactor under construction in France, has 
chosen Bernard Bigot, chair of France’s 
Alternative Energies and Atomic Energy 
Commission (CEA), as its next director 
general. Bigot will replace Osamu Motojima 
and will begin his 5-year term next year. 

Bigot, 64, studied physics and chemistry; 
he has held senior positions in government, 
academia, and industry and was appointed 
as head of CEA in 2009. ITER involves 
seven international partners— China, the 
European Union, India, Japan, South 
Korea, Russia, and the United States— and 
is years behind schedule and hugely over 
budget. Motojima, the second ITER director 
general, was heavily criticized in an external 
management review completed earlier this 
year. http://scim.ag/BigotlTER 

Obama nominates fatigue expert 

Mark Rosekind, a psychologist who has 
demonstrated the benefits of brief naps 
and in-flight sleeping berths for pilots, 
has been tapped to head the National 
Highway Traffic Safety Administration. 
Rosekind led research into fatigue among 
airline pilots in the early 1990s at NASA’s 
Ames Research Center in Mountain 
View, California. The beds he studied are 
now standard on such flights. In 2010, 
he was named to the federal National 
Transportation Safety Board, where he 
has advocated lowering the drunk driving 
threshold from 0.08 blood alcohol content g 
to 0.05, something no state has done. The | 
Senate must confirm his nomination to J 
the highway safety agency, which has been | 
criticized for not doing more to correct | 
faulty car ignition switches and flawed air g 

bags. http://scim.ag/Rosekind J 



Restoring touch to reattached hands 


R oughly 85 people worldwide have undergone hand replant or transplant surgery, 
in which surgeons reattach a patient’s hand or one from a donor by stitching 
together nerves and blood vessels. It takes about 2 years for nerves in the hand 
to regenerate after surgery— and studies suggest that even after the nerves have 
regrown, the surgically attached hand is less sensitive to touch than the original 
hand was. Last week, however, at the Society for Neuroscience's annual conference in 
Washington, D.C., scientists presented evidence that touch can return to near normal. 
Brain imaging data suggest that the brain's sensory map reorganizes after surgery, 
gradually adapting to new sensations from the reattached hand. Damage directly to 
the brain is trickier to repair, but people who have experienced spinal cord injury and 
stroke “may have more potential to recover than we ever thought,” says Scott Frey, 
a computational neuroscientist at the University of Missouri, Columbia, who led the 
work, http://scim.ag/transhandtouch 
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INFECTIOUS DISEASES 

A new phase in the Ebola war 

As new cases drop in Liberia, experts call for a fresh 
strategy that would target flare-ups 


By Kai Kupferschmidt, in Monrovia 

W hen Kevin De Cock flew home 
from this city of 1 miiiion in Au- 
gust, he was leaving behind an 
apocalyptic scene. More than 
100 people were coming down 
with Ebola daily. Patients were 
dying outside of treatment units filled to ca- 
pacity, and bodies lay rotting in the streets. 
Some mathematical models projected that 
Liberia would face thousands of new cases 
weekly by December. “There was really no 
way of knowing how much worse this might 
get,” says De Cock, an epidemiologist at the 
U.S. Centers for Disease Control and Preven- 
tion (CDC) in Atlanta. 

But when he returned to Monrovia on 
9 November, the situation was very differ- 
ent. The grim projections had been wrong. 
Although the Ebola epidemic is still grow- 
ing in Sierra Leone, and Guinea’s numbers 
are swinging up and down, Liberia is now 
reporting only about 20 new patients a day. 
Treatment units have hundreds of empty 
beds, and Liberian President Ellen Johnson 
Sirleaf has lifted the state of emergency put 
in place in August. Now the country faces 
new challenges: rebuilding a shattered 
health care system, tamping down local 
outbreaks, and looking for ways to drive the 
number of new cases to zero. 

No one here is quite sure what has caused 
the epidemic to wane. Safe burials may be 
a big factor, says Katri Jalava, a Finnish 
veterinarian and an epidemiological con- 
sultant to the World Health Organization. 


It’s a local custom to wash the corpse and 
then use the same water to wash the hands 
of the bereaved, she says. “In terms of a 
disease like Ebola that is absolutely mad.” 
Most agree that people’s everyday behav- 
ior has changed as well. Ubiquitous street 
signs warn that “Ebola is real” and tell 
Monrovians “Don’t be the next case.” Out- 
side many homes are small hand-washing 
stations with bleach, and Liberians have 
stopped hugging and shaking hands. 

Yet “this is still a catastrophe,” De Cock 
says. Even 20 daily Ebola cases would have 
been unimaginable a year ago. And Guinea 
has shown that success in fighting Ebola 


Past its peak? 

Confirmed and probable Ebola cases 
in Liberia, by week 
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A gravedigger at the cemetery of the Ebola treatment 
unit in Suakoko District in Bong County. 

can be short-lived: Twice, that country was 
on the cusp of ending the outbreak, and 
twice the virus came roaring back. 

Some have even questioned whether 
Liberia’s recent drop in cases is real. At a 
meeting at the Liberian ministry of health 
last week, a U.S. Agency for International 
Development representative said he had 
been sent specifically to find out if the num- 
bers can be trusted. “Yes,” answered Swed- 
ish statistician Hans Rosling, who has spent 
the past month in Monrovia helping the 
Liberian government interpret epidemio- 
logical data. CDC researchers, for instance, 
have used mouth swabs to test dead bodies 
in Monrovia for Ebola; about 20% to 30% 
are now positive, down from close to 90% 
during the height of the epidemic. The real 
number of cases may be twice the reported 
number, but not much more, Rosling says. 
“We’re in a new phase now.” 

The international response has been 
slow to adapt. Although the Pentagon has 
said it will build fewer new Ebola treat- 
ment centers, their construction is ongo- 
ing. “That doesn’t make sense at all,” says 
Thierry Goffeau, emergency coordinator 
for Doctors Without Borders (MSF) in Li- 
beria. “It’s clearly a waste of human and fi- 
nancial resources.” Rosling, too, says tactics 
have to change. In September, the main job 
was building clinics, removing the dead, 
and keeping as many patients as possible 
isolated. Now, it’s about setting up a flex- 
ible system to respond to new outbreaks, 
identifying patients quickly, and tracing 
their contacts to prevent more infections. 
“What we needed to do in the first phase 
was rugby,” Rosling says. “Now it is chess.” 

Liberia’s medical system, which col- 
lapsed under the weight of Ebola, is gearing 
up again. Doctors are returning to work, 
I and clinics are reopening. Goffeau says 
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Japan to assault asteroid 

Hayabusa 2 will blast open carbon-rich celestial body 


that is sorely needed: “People are dying at 
home of many other diseases than Ehola, 
because they have no access to health care.” 
But medical staff still face an important 
risk. One in every hundred or thousand 
patients may carry the Ehola virus— which 
could start new cycles of infection. There 
are reports that doctors at some clinics are 
now doing surgery and delivering habies in 
Ebola protection suits. 

At Redemption Hospital in Monrovia, 
whose inpatient department was closed this 
summer after several doctors died from Eb- 
ola, MSF is trjdng to protect staff with a new 
triage unit, which opened on 19 November. 
Patients with Ebola-like symptoms are in- 
terviewed; if they meet the criteria for a sus- 
pect case, they stay in one of 10 small rooms 
while their blood is tested. Those who test 
negative can enter the inpatient ward, 
while an ambulance takes Ebola patients 
to a treatment unit. MSF has also started 
distributing malaria drugs to hundreds of 
thousands of people, not just to lower the 
burden of that disease, which was neglected 
for months, but also to reduce the number 
of people visiting hospitals with a fever. 

Reopening Monrovia’s schools poses sim- 
ilar quandaries. One idea is to screen pu- 
pils’ temperature as they enter the school. 
“But what do you do if a 10-year-old kid 
has a high temperature and the other kids 
start pointing at him and shouting ‘Ebola’?” 
Rosling says. In a meeting with President 
Sirleaf, he has argued for a cautious ap- 
proach: Opening some schools and care- 
fully studying what happens. 

The capital region still serves as a reservoir 
from which patients travel to rural areas and 
spark fresh outbreaks, De Cock says— and 
now that the rainy season has ended, travel 
may pick up. In Bong County, for instance, 
a few hours northwest of Monrovia, two big 
outbreaks are spreading, at least one seeded 
from the capital. The treatment unit in the 
district of Suakoko, run by the International 
Medical Corps, is full, and new patients are 
brought in daily. Sambhavi Cheemalapati, 
the unit’s program coordinator, says she is 
seeing far more patients than are accounted 
for in the official numbers. Aid should focus 
on spreading prevention messages in these 
remote locales, Goffeau says. “If the people 
really understand what Ebola is and how 
to avoid infection, we might stop this epi- 
demic,” he says. 

Such regional flare-ups make it unlikely 
that the Liberian epidemic will be over 
anytime soon, Rosling says. Still, he be- 
lieves it’s possible that the country may see 
its first day without a single case as early 
as December. Given the cataclysmic projec- 
tions of just 2 months ago, that would be a 
remarkable turnaround. ■ 
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By Dennis Normile 

E ighteen days after the European Space 
Agency’s Philae lander touched down 
on a comet, another spacecraft will set 
out to attempt an even more intimate 
encounter with a small solar system 
body. On 30 November, Hayabusa 2 
will lift off from Japan’s Tanegashima Space 
Center on a 6-year round trip to asteroid 
1999 JUS. If all goes as planned, it should 
bring home samples not just from the as- 
teroid’s surface but also from beneath it, by 
blasting a crater and collecting ejecta. 

“By comparing samples from the surface 
and from underground, we’ll get a better 
idea of what kind of material makes up as- 
teroids and how it has changed over time,” 
says Sei-ichiro Watanabe, project scientist 
and a planetary scientist at Nagoya Univer- 
sity. Observations of the impact will also pro- 
vide clues about the kilometer-wide body’s 
structural properties, says Erik Asphaug, a 
planetary scientist at Arizona State Univer- 
sity, Tempe, who is not involved with the 
mission. That could help scientists foretell 
the destruction in store should an asteroid 
ever hit Earth. 

A decade ago, the original Hayabusa was 
the first probe to sample an asteroid. It 
snagged a few grains from asteroid Itokawa, 
an S-type, or stony, asteroid thought to have 
formed in the inner regions of the asteroid 
belt, where water and organic material have 
burned off. Overcoming fuel leaks and nu- 
merous equipment failures, Hayabusa deliv- 
ered the samples to Earth in June 2010. 

Its successor’s target is a carbonaceous, or 
C-type, asteroid from the outer regions of the 
asteroid belt, potentially bearing water and 


organic elements. Watanabe says that com- 
bining findings from the two asteroids and 
the comet will help scientists sharpen their 
picture of the early solar system and which 
of its varied bodies might have brought wa- 
ter and organic material— the seeds of life— 
to Earth. 

The roughly $245 million Hayabusa 2 
uses thrusters and stabilizers similar to but 
more robust than those of its predecessor, 
says Hitoshi Kuninaka, project manager at 
the Japan Aerospace Exploration Agenc/s 
Institute of Space and Astronautical Sci- 
ence in Sagamihara. Hayabusa 2’s instru- 
ments are also largely updated versions of 
its predecessor’s, but “they are tuned for a C- 
type asteroid,” which has a very dark surface, 
Kuninaka says. The first Hayabusa carried a 
single small lander, which missed Itokawa 
and drifted off into space. Hayabusa 2 will 
release four landers carrying a variety of 
cameras and instruments. One, the Mobile 
Asteroid Surface Scout, or MASCOT, contrib- 
uted by the German Aerospace Center and 
the French government space agency, will 
explore the surface by hopping across it. 

Hayabusa 2’s most audacious gambit will 
be blasting that crater. The spacecraft will 
touch down to capture surface samples. Then 
it will release an explosive-laden impactor 
and duck behind the asteroid for safety. Det- 
onating the explosives will drive a projectile 
into the asteroid at 2000 meters per second, g 
digging a crater anywhere from 2 to 7 meters 5 
in diameter. “We really don’t know what the s 
asteroid is made up of,” Watanabe says. < 

A floating camera will record the show. J 
Once the dust settles, Hayabusa 2 will touch | 
down near the crater, retrieve ejecta, and | 
then head home. ■ i 
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China supersizes its underfund physics lab 

Planned expansion could pave way for “ultimate dark matter experiment” 


By Dennis Normile 

T he world’s deepest physics laboratory 
Is about to become one of its largest. 
Early next year, workers will start 
carving four cavernous experiment 
halls along a tunnel through Jinping 
Mountain in China’s Sichuan prov- 
ince. Once the science at the China Jinping 
Underground Laboratory (CJPL) is scaled up 
as well, “it will be a milestone for Chinese 
physics,” says Nigel Smith, director of the 
underground SNOLAB in Sudbury, Canada. 

Opened in December 2010, CJPL is the 
deepest facility of its kind, tvith 2400 meters 
of rock shielding it from background radia- 
tion (see chart). The lab so far has focused 
on the hunt for dark matter, the universe’s 
postulated missing mass. More space will 
allow larger and more sensitive dark matter 
detectors and an expanded research agenda 
that will include a nuclear astrophysics ac- 
celerator to replicate the inner workings of 
stars. CJPL also hopes to branch out into ob- 
serving neutrinos and studying exotic par- 
ticle phenomena. 

Deep underground labs elsewhere have a 
head start in all of these areas. This means 
the Chinese will have to choose research 
targets carefully based on “if and where 
they can do better” than existing experi- 
ments, says Alessandro Bettini, director 
of the Canfranc Underground Laboratory 
in Spain. Others have confidence in the 
Chinese quickly coming up to speed. “It’s 
a highly competitive site [with] lots of po- 
tential,” says John Ellis, a theorist at King’s 
College London who chairs a new interna- 
tional advisory committee that visited the 
lab last month. 

China’s ascent in underground physics 
began serendipitously in August 2008, when 
Qian Yue, a physicist at Tsinghua University 
in Beijing, saw a TV report about access tun- 
nels being bored through Jinping Mountain 
for a massive hydroelectric project. Tsinghua 
approached the Yalong River Hydropower 
Development Co. Ltd., which agreed to exca- 
vate two experiment halls totaling 4000 cu- 
bic meters along one of the tunnels {Science, 
5 June 2009, p. 1246). 

CJPL, run by Tsinghua, now hosts two dark 
matter experiments. The Particle and Astro- 
physical Xenon (PandaX) experiment uses a 
37-kilogram liquid xenon target to watch for 
dark matter in the form of postulated weakly 
interacting massive particles (WIMPs). If 


WIMPs exist, they should occasionally travel 
unmolested through the mountain and col- 
lide with a xenon nucleus, producing a flash 
of light. In the other experimental hall, the 
China Dark Matter Experiment (CDEX) aims 
to catch the electrical signal produced if a 
WIMP bumps into a nucleus within a germa- 
nium crystal. “There is complementarity” be- 
tween the two approaches, says Henry Wong, 
a physicist at Academia Sinica’s Institute of 
Physics in Taipei and member of the CDEX 
collaboration. Xenon detectors should be 
better at distinguishing a WIMP signal from 
flashes sparked by some kinds of background 
radiation, whereas the more sensitive germa- 
nium detectors ought to be able to spot inter- 
actions involving lighter WIMPs. Although 
neither experiment has yet detected a WIMP, 
they both have helped confirm results from 


Deep, dark labs 

In the hunt for dark matter, deeper is better. 
Labs are built in mines (light blue) and tunnels 
(dark blue and red). 
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Other labs indicating that WIMPs are likely 
to have very little mass. 

For an initial effort, the results are “pretty 
decent,” says Wick Haxton, a theorist at the 
University of California, Berkeley. To boost its 
chances of sighting WIMPs and determining 
their mass, CJPL needs a larger volume of 
xenon, more germanium crystals, and better 
shielding. All of that requires more space. “If 
they significantly enlarge those experiments 
over the next couple years,” Haxton says, 
“they could end up being very competitive.” 

CJPL is about to get the elbow room it 
needs. Before the hydropower construction 
work wraps up next year, crews will bore 
another four 130-meter-long experiment 
halls. When lined with concrete, each will be 
13.2 meters wide with an arched ceiling 
13.2 meters high. All told, the enlarged facil- 
ity will have 120,000 cubic meters of research 
space, second only to Italy’s Gran Sasso Na- 
tional Laboratory, which has 180,000 cubic 
meters. By piggybacking on the hydropower 
project, Tsinghua limited the expansion’s 
cost to $50 million. Yue hopes to start experi- 
ments in the new halls by the end of 2016. 

CDEX aims to boost its sensitivity by in- 
creasing its germanium target from 1 to 
10 kilograms and by adding more elaborate 
shielding. The PandaX team is thinking 
much bigger. ‘We are interested in build- 
ing the ultimate dark matter experiment: a 
20-ton scale liquid xenon experiment,” says 
Xiangdong Ji, a physicist at Shanghai Jiao 
Tong University and the University of Mary- 
land, College Park. That would be several 
times larger than existing liquid xenon ex- 
periments. Realizing such a mammoth proj- 
ect, says Ji, who leads PandaX, could require 
teams worldwide to pool resources. 

A new experiment planned for the ex- 
panded space is the Jinping Underground 
laboratory for Nuclear Astrophysics (JUNA). 
Its piece de resistance would be a particle 
accelerator used to replicate the nuclear pro- 
cesses generating energy within stars and the 
synthesis of heavier elements from hydrogen 
and helium in the primordial universe. The 
rock shielding would reduce background 
noise, making it easier for researchers to 
detect rare and subtle signals. With a more 
powerful accelerator and a deeper location 
than other efforts, says project head Weiping 
Liu, a physicist at the China Institute of 
Atomic Energy in Beijing, “JUNA has the 
potential to take a favorable position among 
underground nuclear astrophysics labs.” ■ 
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CLIMATE SCIENCE 

Chasing South 

America’s 

monsoon 

Future of key rain pattern in 
doubt as climate negotiators 
gather in Peru 

By Lizzie Wade 

W hen delegates from nearly 200 na- 
tions gather In Lima next week 
to begin laying the groundwork 
for a new international accord to 
curb global warming, few may re- 
alize that they’ve stepped into the 
middle of a climate mystery. In the past few 
decades, researchers have recognized that 
tropical South America’s summer rainy sea- 
son, which fills parched rivers and brings 
new snow to rapidly shrinking mountain 
glaciers, is part of a larger monsoon system. 
They are now starting to unravel the com- 
plex dynamics of the system, which helps 
supply water to some 350 million people— 
and struggling to forecast how climate 
change may transform it. 

“There [are] millions of people who rely 
on that water, and we take it for granted,” 
says climate scientist Mathias Vuille of the 
University at Albany, State University of New 
York. “But it’s a very, very sensitive system, 
and we have to be aware of that.” He and 
other researchers have found hints that the 
monsoon pattern may already be changing. 
Now, they are turning to ancient lake sedi- 
ments to reconstruct the monsoon’s past be- 
havior, and using simulations to see how the 
wet and dry seasons might shift in the fu- 
ture if nations can’t agree on ways to reduce 
greenhouse gas emissions. 

The South American monsoon is “not 
your typical monsoon,” says Broxton Bird, a 
paleoclimatologist at the Indiana University- 
Purdue University Indianapolis School of 
Science. Traditionally, scholars defined a 
monsoon as a seasonal reversal in wind pat- 
terns that creates wet and dry seasons. That’s 
exactly what happens over India, where the 
English word first came into common usage. 
There, it is “the Monsoon” with “a capital M,” 
says Paul Baker, a geologist at Duke Univer- 
sity in Durham, North Carolina. 

Over time, however, researchers “started 
talking more about monsoon tvith a little m,” 
Baker says, using the term to refer to strong 



seasonal changes in precipitation, not nec- 
essarily wind. In the late 1990s, researchers 
realized that a monsoon in this sense affects 
a large swath of South America, stretching 
from western Peru and northern Argentina 
across Amazonia to Brazil’s east coast (see 
map). Each region’s rainy season is not an 
independent, local phenomenon, but rather 
part of a climate web that reaches far beyond 
the continent’s borders. 

Understanding what drives the mon- 
soon and why it varies from year to year is 
proving tricky. One obstacle is a lack of his- 
torical data. Researchers studying India’s 
monsoon have benefited from “rain gauges 
... that go back more than 100 years,” says 
Charles Jones, a climate modeler at the 
University of California, Santa Barbara. 


Rainmaker 


Between 1979 and 2004, the South American 
monsoon provided more than half of the 
annual precipitation in some areas. 



“We don’t have [that] over the Amazon.” 

To fill the gap, some researchers are pick- 
ing through sediment cores taken from 
lake bottoms, which hold clues to past cli- 
mates. Baker, for instance, has reconstructed 
25,000 years of South America’s climate his- 
tory by studying the remains of tiny aquatic 
plants, called phytoplankton, trapped in 
sediments from Lake Titicaca on the border 
of Peru and Bolivia. Because rainfall feeds 
the lake, it becomes shallower during weak 
monsoon years, causing plankton species 
that thrive under those conditions to boom. 
More shallow-water plankton in a sediment 
layer means a weaker monsoon at that time. 

The sediments reveal a pattern: Over 
thousands of years, the South American 
monsoon was particularly strong when sea 
surface temperatures in the North Atlantic 
were relatively cool. But when the North 
Atlantic warmed, the monsoon rains weak- 
ened, the researchers reported in a 2001 pa- 
per in Science. 

More recent studies have confirmed the 
link, and scientists believe they understand 
the mechanism. When the North Atlan- 
tic cools. Bird explains, the atmosphere’s 
Intertropical Convergence Zone (ITCZ), an 
equatorial band typified by heavy clouds and 
precipitation, moves south. That leads to 
stronger summer monsoon rains. When the 
North Atlantic warms, however, the ITCZ 
and its moisture drift north, drying out the 
monsoon belt. Two recent record-breaking 
droughts in Amazonia, in 2005 and 2010, 
were associated with particularly warm peri- 
ods in the North Atlantic, Bird notes. 

Paleoclimate studies have also helped re- 
veal that the monsoon has weakened in the 
past century. “The monsoon as we know it 
today is not necessarily representative of 
the conditions over the last 1000 years,” 
Vuille says. “It’s rather on the weak end of 
the spectrum.” 

What future global warming may do to 
the South American monsoon is unclear. 
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Sediments from Lake 


Titicaca have proved key to 


reconstructing the South 


American monsoon's past. 


When researchers use computer models to 
create a world with twice as much atmo- 
spheric carbon dioxide as today and tem- 
peratures several degrees higher, they get 
mixed results. “Half of the models say ... it’s 
going to get wetter in the monsoon area, 
and the other half say it’s going to be drier,” 
Baker says. “Half the models are right, but 
we don’t know which half.” 

The models do, however, tend to sug- 
gest the winter dry season could get longer. 
That may already be happening: The dry 
season is now “about 3 weeks to 1 month 
longer” than in recent decades, with the 
rains beginning in November instead of 
October, notes Jose Marengo, a senior sci- 
entist at the Brazilian National Institute 
for Space Research in Sao Paulo. Such ex- 
tended dry spells could spell trouble for 
farmers and hydropower dams. (Brazil gets 
80% of its electricity from dams, Marengo 
notes.) If the trend continues, the historic 
2005 drought could become “the new 
norm” in southern Amazonia by the late 
21st century, a team of climate researchers 
concluded in a 2013 paper in the Proceed- 
ings of the National Academy of Sciences. 

Some models also hint that wet seasons 
could become shorter but more intense, 
says Herve Douville, a climate modeler at 
the National Centre for Meteorological Re- 
search in Toulouse, France. That’s worrying 
because extreme rainfall can lead to flash 
floods and landslides, a particular concern 
because the monsoon belt includes hilly, 
densely populated cities. 

Delegates heading to the Lima climate 
negotiations hope they can reduce such 
climate threats. The goal is to advance a 
follow-on pact to the 2005 Kyoto Protocol, 
which would be finalized in December 2015 
at a meeting in Paris and then enter into 
force in 2020. Although South America’s 
monsoon isn’t likely to be a major topic in 
Lima, the meeting’s outcome could loom 
large in its future. ■ 


MEDICAL RESEARCH 

U.S. to expand public access 
to clinical study results 

Proposals would require sponsors to report data 
on thousands of additional trials each year 


By Jocelyn Kaiser 

T he amount of clinical data that drug 
companies must share with the pub- 
lic would vastly expand under new 
rules proposed last week. A proposal 
from the U.S. Department of Health 
and Human Services (HHS) would 
require trial sponsors to report summary 
results for drugs and devices that are never 
approved, not just for those that reach the 
market. And a draft policy from the Na- 
tional Institutes of Health (NIH) would ex- 
pand the requirement— which now applies 
only to trials regulated by 
the U.S. Food and Drug 
Administration (FDA)— 
to all trials funded by the 
health agency. 

Sharing these results 
should not only be use- 
ful for researchers, but 
also “helps fulfill society’s 
ethical responsibility” to 
people who volunteer for 
trials, said NIH Director 
Francis Collins during a 
press teleconference. “We 
owe to our patients, to 
our participants in these 
trials, the explanation of 
what happened.” 

The results would be 
posted on ClinicalTrials.gov, a public data- 
base launched in 2000 that now contains 
registration data for more than 178,000 tri- 
als regulated by FDA. Under existing rules, 
drug companies must also submit summary 
results that include Information such as the 
number of participants, their age and gen- 
der, outcomes, and adverse events. These 
results have been posted for more than 
15,000 trials. 

But the summary results requirement 
applies only to drugs and devices ap- 
proved by FDA. Under the HHS proposal, 
companies will also need to report results 
for unapproved products— although only 
for late-stage trials, not early safety trials, 
known as phase I. The new requirement to- 
gether with the NIH proposal, which would 
require reports from roughly 650 trials a 
year, should add another 100 to 150 reports 


to the 100 that the database now receives 
each week, said Deborah Zarin, director of 
ClinicalTrials.gov. 

NIH officials cite a 2014 analysis of 
400 clinical trials that found that 4 years af- 
ter completion, 30% of the studies had not 
shared results in a journal or ClinicalTrials. 
gov. Under both the NIH and HHS draft 
rules, which won’t take effect until after the 
final policies are published, results must be 
reported within a year after the trial ends. 
Noncompliance could be punished by with- 
holding funding for NIH grantees or impos- 
ing fines on companies regulated by FDA, 
said Kathy Hudson, NlH’s 
deputy director for sci- 
ence, outreach, and policy. 

Peter Doshi of the 
University of Maryland 
School of Pharmacy in 
Baltimore, an advocate 
of clinical data shar- 
ing, welcomes the new 
requirements. Hidden 
trial results are “not 
good from an evidence 
perspective and it’s em- 
barrassing from a policy 
perspective,” he says. Re- 
quiring that trial results 
for unapproved drugs be 
shared is also a positive 
step, Doshi says. “You’re 
reducing the chance that somebody will 
redo experiments that were already done 
and put people in harm’s way because the 
research wasn’t shared.” But Doshi says 
the U.S. plans compare unfavorably with 
Europe’s, where the European Medicines 
Agency plans to make detailed clinical data 
reports publicly available for approved 
drugs and to provide data on individual 
patients to researchers. 

Doshi is an associate editor of The BMJ, 
which is part of a group called AllTrials that 
is pushing for release of detailed clinical 
trial data. “The white elephant in the room 
is that FDA sits on more data, across more 
drugs, across more therapeutic indications 
than anybody else on the planet,” he says. 
“And their attitude is, ‘Great idea [to make 
the data public], let somebody else take 
care of this.’ ” ■ 


Sharing clinical trial 
results “helps fiilfill 
society’s ethical 
responsibility. ... We 
owe to our patients, 
to our participants 
in these trials, the 
explanation of what 
happened.” 

Francis Collins. National 
Institutes of Health 
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By Eric Hand 

J ay Piatek pulls his Jaguar convert- 
ible onto the highway frontage road 
and turns past the billboard that 
bears his name and a slogan: Weight 
loss. Health. Motivation. It is here, 
at the Piatek Institute, on the bland, 
exurban edge of Indianapolis, that 
the doctor made his fortune, minis- 
tering to thousands of patients who 
struggle with eating compulsions. 

Piatek has his own compulsions. Locked 
away at the clinic is a climate-controlled 
vault for his abiding passion: meteorites. 
With a near-photographic memory, he is 
able to rattle off the names and provenance 
of some of his more exotic specimens— 
Itzawisis, a pallasite found in Namibia; Gu- 
jba, abencubbinite found in Nigeria. “They’re 
like little movie stars,” he says dreamily. 

There is one diva in particular that I’m 
here to pay homage to: Black Beauty, a shiny, 
scaly-skinned, 4.4-billion-year-old rock from 
Mars. It began its journey to Earth more 
than 5 million years ago, about the time 
humans and chimpanzees were splitting 
from a common ancestor. That is when an 
asteroid struck Mars, catapulting the rock 
into space. Sometime in the last thousand 
years or so, orbital mechanics and grav- 
ity delivered the wandering rock to Earth. 
Surviving an incendiary plunge through the 
atmosphere, it landed in more than a dozen 
pieces in the western Sahara. There the frag- 
ments sat, untouched except by wind and 
sand. Finally, a nomad plucked a piece from 
the dunes. After passing through the hands 
of several Moroccan middlemen, the first 
piece wound up in Piatek’s hands in 2011. 
He would acquire nine more. 

Black Beauty has since set the collecting 
world on fire, reaching values of more than 
$10,000 per gram. (Gold trades for $40 per 
gram.) The price is in no small part due to 
the parade of scientific discoveries emerg- 
ing from the rock’s jumbled-up guts. It is 
the oldest rock from Mars and chock-full of 
the planet’s primordial water. Most intrigu- 
ing of all, it appears to be the first martian 
meteorite made of sediment, deposited by 
wind or water. That makes Black Beauty not 
only a cosmic blessing— sedimentary rocks 
are fragile and thought unlikely to survive 
interplanetary launches— but also a boon 
for astrobiologists. “If you’re going to look 
for life, you want a sedimentary rock,” says 
Munir Humayun, a meteoriticist at Florida 
State University in Tallahassee who led a 



Black Beauty, a meteorite from 
ancient Mars, has captivated collectors 
and scientists 


Jay Piatek, a doctor and meteorite coliector 
from Indianapolis, controls two-thirds of the 
2 kilograms of Black Beauty known to exist. 


1044 28 NOVEMBER 2014 • VOL 346 ISSUE 6213 


Corrected 26 November, 2014; see full text. 

Published byAAAS 


scienceinag.org SCIENCE 


.sciencemag.oi 



SCIENCE scienceraag.org 


Corrected 26 November, 2014; see full text. 

Published byAAAS 


28 NOVEMBER 2014 • VOL 346 ISSUE 6213 1045 






I FEATURES 



In 2012 and 2013, hundreds of meteorite hunters combed the sands of the western Sahara for Black Beauty— and found pieces like this 241-gram specimen. 


study that last year pinpointed the rock’s age. 

Even if it holds no trace of life, Black 
Beauty has plenty to captivate scientists. It 
is a breccia— a rock made of rocks, welded 
together in a fine-grained matrix. Each em- 
bedded pebble has a history to be unraveled. 
Black Beauty holds not just a geological story 
but an immense anthology. “We’re looking at 
the equivalent of a martian geological field 
area,” says Carl Agee, a meteoriticist at the 
University of New Mexico, Albuquerque. 

Agee led the team that, in a 2013 publica- 
tion in Science, was the first to recognize that 
Black Beauty was from Mars— and yet was 
unlike any of the other 74 known martian 
meteorites. As expensive as Black Beauty has 
become, it is still a bargain compared with 
proposals for a robotic Mars sample return 
mission. “I sort of half-jokingly say ‘Morocco 
sample return,’ ” Agee says. “Some of these 
meteorites, like Black Beauty, are the next 
best thing.” 

About a dozen institutions have paid 
dearly— or traded choice holdings— for the 
privilege of working with tiny slices. Many 
more scientists are in the queue for pieces. 
And Piatek, through a combination of luck, 
money, and quick action, has cornered the 
market. He controls about two-thirds of the 
2 kilograms of Black Beauty known to exist. 
The pieces are waiting for us at the clinic. 

THE SUN HAS STARTED TO FALL on this 
Steamy July day, but Piatek is just revving 
up as he parks. Entering the clinic, he shows 
me a room-sized tank stocked with tropical 
triggerfish, a tank so big that Piatek puts on 
a wetsuit and goes diving to clean it. There 
are empty patient waiting rooms, decked 
out in a style you might call Greco-Vegas: 
plastic fruit, purple drapes, faux-Dorlc col- 
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umns. A few hours before, they would have 
been filled with people waiting for 5-min- 
ute visits with Piatek and his prescriptions 
of hormones, vitamins, and stimulants. His 
fellow doctors may look askance, but Piatek 
doesn’t really care. “They think we’re just 
giving out speed,” he says. “I’d rather have 
my patients love me. Screw my colleagues.” 

Normally, Piatek keeps Black Beauty and 
his other most valuable specimens at a bank 


vault. But he has just recently returned from 
the Smithsonian Institution National Mu- 
seum of Natural History in Washington, D.C., 
where he attempted to make a trade, and 
so he hasn’t yet returned the stones to the 
bank. They lie in a carry-on roller bag inside 
the clinic’s vault. Piatek punches in numbers 
and tugs on the door. “Unbelievable,” he says. 
“Look at that.” The door handle is jammed. 

He fetches some tools and pops out the 
hinges, but the door is still wedged. Piatek 
calls his niece to summon more muscle: 
her husband. Brock. “Tell him to bring a 
couple screwdrivers, and a few pliers, but 
not little ones.” Then he calls another num- 
ber. “It’s me, Piatek. I’m coming in with a 
friend.” He hangs up and faces me. “Do you 
like steak?” he asks. 

RUTH’S CHRIS STEAK HOUSE is Piatek’s 
customary retreat after watching Indiana 
Pacers basketball games from his courtside 
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seats. There, over steaks and California 
cabernet, Piatek begins the saga of a 4.4- 
billion-year-old rock and the 53-year-old 
man who owns it. 

Piatek was born in Gary, Indiana, to a 
mother who worked in the steel mills. His 
adoptive father was a driver for the Are de- 
partment. He spent college summers at the 
mill, working in the coke ovens. “It was, like, 
brutal,” he recalls. “You’re blowing your nose 
for 2 weeks afterwards and coke is coming 
out, and it’s not the coke that’s good.” Medical 
school at Indiana University opened the way 
to a different life. 

After marriage, four children, and nearly 
a decade as an emergency room doctor, 
Piatek started his weight loss practice in 
1995. Business was brisk. At one point, he 
was seeing 150 patients a day. But in 2003 his 
son was working on a fourth-grade school 
project on meteorites. “What’s a meteorite?” 
Piatek asked him— and an obsession was 
born. “I loved that you could get a piece of 
outer space that’s billions of years old, and 
you can hold it in your hand,” he says. 

There are more than 50,000 named and 
classified meteorites. The vast majority are 
chondrites, pieces of asteroids filled with □ 
little glassy beads called chondrules. Much £ 
rarer— and more expensive— are meteorites | 
from the moon. Mars, and other special bod- S 
ies in the solar system, such as the asteroid S 
Vesta. At first, Piatek was disappointed to | 
learn that he couldn’t own a piece of every- S 
thing: Some were locked up in institutional I 
collections, others destroyed or lost. He p 
made a list of priorities and started hunt- i 

O 

ing on eBay, quickly graduating to gem and t 
mineralogy shows. At his peak, he had 1300 S 
specimens. By comparison, Arizona State g 
University, Tempe, which has the largest if 
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“It was the most expensive 
stone of my life. ... I have 
never been so stressed. ” 

Aziz Habibi. meteorite dealer 


university-owned coliection in the world, has 
about 2000, says Meenakshi Wadhwa, the di- 
rector of Arizona State’s Center for Meteorite 
Studies, who recently traded with Piatek for 
a 20-gram cut of Black Beauty. 

A few years ago, spurred by chest pains 
and intimations of his mortality, Piatek de- 
cided to cut back on both his working hours 
and his meteorite addiction. He brought on 
another doctor to do most of the work at the 
clinic. And he sold 400 of his specimens— 
including a couple of Black Beauty fragments 
—for more than a million dollars to Naveen 
Jain, a tech billionaire. Although the trans- 
action delighted his wife, who never quite 
understood the value of collecting these 
rocks, Piatek winces at the memory. “I had 
stuff that no one had,” he says. “1 had the 
one that hit the lady, I had the one that hit 
the dog.” 

Piatek has whittled his collection down 
to a prized 400 or so. But his passion still 
burns. One paramount goal: completing his 
collection of pallasites, pretty meteorites 
that often contain green olivine crystals sus- 
pended in a gray matrix like luminous polka 
dots. Of the 59 pallasites that are possible to 
get (a few others reside in untouchable col- 
lections), Piatek has 57. 

One, the El Rancho Grande pallasite, came 
in a trade from Agee, a genteel scholar who 
was once chief scientist for NASA’s Johnson 
Space Center in Houston, Texas, and now cu- 
rates the University of New Mexico’s meteor- 
ite collection. “Our relationship was sealed at 
that point,” Piatek says. He has donated more 
of Black Beauty to the University of New 
Mexico than anywhere else, elevating its col- 
lection from obscurity 

The relationship would prove crucial 
when it came to figuring out the rock’s ori- 
gin. In 2011, Piatek bought the first piece, a 
320-gram mass, from Aziz Habibi, a dealer 
from Erfoud, Morocco, who had been regu- 
larly sending Piatek pictures of meteorites 
for sale— including one he called Black 
Beauty. Almost on a whim, Piatek paid 
$6000 for the stone— cheap in the mete- 
orite world— thinking it was nothing more 
than a regular chondrite. (Piatek says he 
later paid a bonus to Habibi when its true 
nature was revealed.) 

Piatek sent the stone to Agee, who wasn’t 
convinced that it was a meteorite at all. It 
didn’t have the heft of a chondrite, which 
are typically rich in dense metals. And the 
scaly skin— the “fusion crust” that forms on 
the superheated surface of a falling meteorite 
—seemed so shiny that it might be fake. “I 
thought someone had taken a desert stone 
and spray-painted it,” Agee says. Non- 
plussed, he stuck the rock on a shelf for a 
few months. Eventually, in the fall of 2011, 
he took a diamond-tipped rock saw, sliced 
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off one end of the stone— and marveled at 
what he saw inside. Dark, angular crystals 
of pyroxene floated alongside white, chunky 
feldspars. Large, faint pebbles sat next to 
tiny, dark beads. It was evocative of the lu- 
nar breccias Agee recalled from the Apollo 
days— except that Black Beauty’s spherules 
were much more diverse. 

Agee now knew he had a meteorite, but 
what was it? He chipped off a gram piece 
and put it under an electron microprobe, 
which uses an electron beam to excite at- 
oms in the rock’s minerals. The atoms then 
emit x-rays that reveal the sample’s chemical 
makeup. It turned out that the rock had an 
elevated manganese-to-iron ratio— higher 
than that in Earth rocks and consistent with 
other martian meteorites. Next, Agee and 
his colleagues used a laser to extract water 
molecules trapped within minerals in the 
meteorite and fed them into a mass spec- 
trometer to calculate the ratio of deuterium, 
a heavy isotope of hydrogen, to ordinary 
hydrogen. Every place in solar system has a 
distinctive ratio. Lo and behold, the copious 
water in Black Beauty was Mars-like. 

Agee was convinced that Black Beauty 
was from Mars— and also that it was unlike 
all the other martian meteorites. In a talk 
at the Lunar and Planetary Science Confer- 
ence in Texas in March 2012, he and his team 
described the rock as a volcanic breccia— a 
melange of mineral crystals that formed 


A winding journey 

The martian meteorite calied Biack Beauty 
is the oldest in existence. 

P 4.4 billion years ago 

The earliest pieces of Biack Beauty form 
in a subterranean magma chamber. 

5 million years ago 

An asteroid impact strikes Mars, 
iaunching Black Beauty from the surface 
and toward Earth. 

— <1000 years ago 

Black Beauty falls in the dunes of the 
Sahara desert in more than a dozen pieces. 

- 2011 

Jay Piatek, a doctor from Indiana, 
acquires a first 320-gram piece. 

2012 

Researchers identify Black Beauty as a 
2.1-billion-year-old volcanic rock from Mars. 

2013 

Martian origin confirmed and published; 
samples reinterpreted as sedimentary, 

4.4 billion years old. 
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underground in different slow-cooling 
magma chambers and were blasted together 
and fused by an explosive eruption. 

After Agee’s team published their find- 
ings in Science in February 2013, Piatek’s life 
instantly became more complicated. “You 
guys killed me,” he says. Almost overnight, 
Moroccan dealers raised their asking prices 
for chunks of Black Beauty from tens to hun- 
dreds to thousands of dollars per gram. And 
while Piatek had previously had the market 
to himself, he now had rivals. Chief among 
them was another doctor-cum-collector: a 
Frenchman named Luc Labenne. “I knew he 
was going to be my competition,” Piatek says. 
“Then, it was on.” 

LABENNE HAS A SMALL PIECE OF MARS 

in the pocket of his bright red pants. He came 
to Casablanca, Morocco, this past September 
for the annual meeting of the Meteoritical 
Society. Like many collectors and dealers, 
Labenne cares deeply about the science and 
spends much of his time at the conference 
diligently attending the scientific sessions— 
especially the dozen talks that focus on Black 
Beauty. But he is also here to do business. He 
has just paid a local dealer nearly €10,000 for 
a 25-gram martian meteorite called a sher- 
gottite. The most common type of martian 
meteorite, shergottites are relatively cheap 
and not nearly as scientifically interesting 
as Black Beauty, but this particular rock’s 
delicate fusion crust should boost its value. 
“It’s just magnificent,” he says. “It’s black and 
fresh. So great.” 

Like Piatek, Labenne is a semiretired 
physician. But unlike the frenetic Piatek, 
Labenne is contemplative and keeps his 
own counsel. He often takes several trips a 
year to Libya, Morocco, Chile, and Oman to 
hunt for meteorites with his brother. Driving 
slowly and systematically across the desert 
from sunup until sundown, he takes a quiet 
pleasure in scanning the landscape for dark 
objects that don’t belong. At night, he uses 
LED lights to search for meteorites near the 
campsite— and has found two lunar meteor- 
ites this way. Labenne owns seven pieces of 
Black Beauty, the second most after Piatek, 
and has been a primary source of material 
for scientists in Europe. 

During a coffee break at the conference, 
Labenne lurks quietly behind Habibi, the 
flashy chain-smoking impresario of Moroc- 
co’s meteorite dealers. Habibi has a suitcase 
open and is showing off samples to scientists, 
unwrapping them from foil or old socks. He 
was hooked on fossils and meteorites as a 
teenager, when he explored the quarries in 
his hometown, Erfoud, for ammonites, the 
nautilus-shaped mollusks that lived hun- 
dreds of millions of years ago. Habibi now 
operates a 140-room hotel in Erfoud, but his 
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Rock of ages 

Black Beauty is a breccia, or a rock made of smaller rocks— as this slice of the original 
320-gram mass reveals. 


NEWS I FEATURES 


main love is meteorites— and the wheeling 
and dealing in them. 

In 2011, Hahihi was in the town of Agadir, 
recovering from gallstone surgery, when he 
met another dealer who offered him a shiny 
hlack stone. No one could agree what it was, 
or was worth, and Hahihi passed on it. But as 
he was driving away, he had second thoughts. 
He turned around and made a deal for the 
first, 320-gram piece of Black Beauty. 

With meteorites, where there is one, there 
are usually more. In 2012, as reports of 
Black Beauty’s significance began circu- 
lating, Hahihi went searching. He worked 
upstream from the Agadir dealer and dis- 
covered that the original finder was a nomad 
who worked the desolate terrain between 
Mauritania and southwestern Morocco. The 
nomad, whose name is Bahba, had picked up 
the stone from a void, featureless place called 
Rabt Sbayta (“sand dunes of Sbayta”), a few 
hundred kilometers from the coast within 
the disputed territories of Western Sahara. 

Hahihi wasn’t the only dealer making in- 
quiries. Rabt Sbayta soon became a village, 
with dozens of encampments and hundreds 
of people combing the shifting dunes. The 
rules were simple: finders keepers. A “strewn 
field” emerged: a stretch of desert roughly 
10 kilometers long and a few kilometers wide 
strewn with pieces that fell as Black Beauty 
broke up in the final moments of its descent. 
Newfound stones were sold up through the 
food chain of dealers like Hahihi and usually 
wound up in the hands of Piatek or Labenne. 
“It was a big race,” Hahihi says. “It was the 
most expensive stone of my life. We were 
paying $1000 per gram here in Morocco. I 
have never been so stressed.” 

Meteorite collectors, dealers, and scientists 
are intertwined in wary but symbiotic rela- 
tionships. Science gains from a long-standing 
“20/20” rule: To get a meteorite name and an 
official classification— which makes a stone 
more valuable— a dealer or collector must 
hand over to a scientific institution 20 grams 
or 20% of the weight of the stone, whichever 
is less. Scientists have now studied and clas- 
sified seven of the fallen Black Beauty stones. 
But the rest— including most of Piatek and 
Labenne’s stones— are unclassified, as the 
ovraers have little incentive to give up the 
material gratis. 

Hasnaa Chennaoui, a meteoriticist at 
Hassan II University in Casablanca who 
organized the conference, complains that 
the 20/20 rule rewards scientists in Europe 
and North America, where collectors have 
the most money and scientists the best 
equipment. Her university lacks an elec- 
tron microprobe— the primary tool used for 
classification— and so she understands why 
collectors and dealers tend to go to West- 
ern scientists. But she feels that she and her 
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1. Fusion crust: The shiny, scaly skin 
comes from a fiery fall and years of 
desert sandblasting. 

2. Large pebbles: The rounded outline of 
pebbles suggests erosion in wind or water. 

3. Blueberries: These iron-oxide concretions 
may have precipitated out of water. 


Moroccan colleagues ought to have a chance 
to study their own country’s meteorites. So 
far, she has had no research access to Black 
Beauty. “Yes, it goes to science,” she says, “but 
to science outside of Morocco.” 

ALL THE PLAYERS in the world of meteor- 
ites have their incentives. Hunters want to 
make money. Collectors want to make a com- 
plete set. And scientists want to make their 
name. That holds for Florida State’s Huma- 
yun, who knew that Black Beauty was going 
to be special from the moment Agee spoke 
at the 2012 Lunar and Planetary Science 
Conference. In his talk, Agee presented the 
results of the isotope analysis for oxygen as 
well as hydrogen. The hydrogen for isotopes 
were consistent with Mars-like water. But the 
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4. Igneous rocks: There are many 
igneous rock pieces, or clasts, that originally 
formed in cooling magma chambers. 

5. Impact melts: Glassy teardrops 
and beads are impact melts, a sign that 
asteroid impacts were important in 
fusing the rock together. 


oxygen isotope ratios were puzzlingly differ- 
ent from those in all other martian meteor- 
ites. Humayun thought he had the solution. 
After the talk, he jumped up and asked for a 
sample. “Carl said he’d give me one, but he 
wasn’t in a hurry,” he says. “I just wanted to 
get my hands on it, to tell the truth.” 

Tidy in dress and excitable in manner, 8 
Humayun is a perfect foil to the laid-back | 
Agee, who says he wasn’t trying to sideline | 
Humayun from access to Black Beauty; he o 
just didn’t get around to sending him any | 
in time. So Humayun arranged to work on | 
a sample held by the National Museum of ? 
Natural History in Paris. He realized that ? 
the skewed oxygen isotope ratios could be < 
explained if the rock had been exposed to g 
martian air or water for a long period. That if 
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couldn’t have happened in Agee’s explosive- 
eraption origin scenario, because the rock’s 
interior would have been quickly sealed off 
from the elements. 

“It struck me,” Humayun says, “that 
this was not a volcanic breccia.” Instead, 
Humayun argues, Black Beauty is a sedimen- 
tary rock. The mineral and rock ingredients 
had formed igneously, from cooling magma, 
but they then eroded into sediments that 
were altered by water and air. Sometime 
later, asteroid impacts fused the breccia into 
a solid rock, before a final impact launched it 
into space. Humayun published his conclu- 
sions in Nature in November 2013. 

The Nature paper also reassessed Black 
Beauty’s age. Agee’s team had pegged it at 
2.1 billion years, based on a radiometric dat- 
ing of rubidium and strontium, averaged 
across the sample in bulk. That age would 
have made Black Beauty much older than 
the shergottites, which are on average a few 
hundred million years old, but not nearly 
as old as the 4.1-billion-year-old Allan Hills 
84001, a softball-sized Mars meteorite that 
sparked lasting controversy in 1996, when 
some scientists claimed it contained fossil- 
ized bacteria. In Huma 5 ain’s scenario, how- 
ever, different components of the rock could 
have formed at different times. 

Humayun’s team analyzed rock pieces 
rich in pyroxene and feldspar— minerals that 
would have been among the first to settle out 
of a cooling magma chamber. In these re- 
gions they found zircons, hardy little crystals 
that are valued because they suck up enough 
radioactive uranium to be used for dating. 
The age Humayun’s team obtained from 
these minerals— 4.4 billion years— would 
make Black Beauty, or at least these regions 
within it, the very oldest martian meteorite. It 
would also mean that, just a hundred million 
years after Mars itself coalesced as a molten 
ball of rock, its crust had cooled enough to 
support magma chambers in which zircons 
could form. 

Now scientists are pressing further into 
the rock. Every cut and slice seems to 5 deld 
a surprise. In September, at the Meteoritical 
Society meeting in Casablanca, Humayun 
reported finding veins of pyrite, fool’s 
gold. The veins crosscut other parts of the 
meteorite— meaning they must have been 
among the last features to form while the 
rock was sitting on Mars— and the mineral is 
intrinsically Interesting because it forms in 
the presence of water. 

Also at the meeting, Francis McCubbins, 
a colleague of Agee’s at the University of New 
Mexico, presented evidence that the rounded 
appearance of the large faint pebbles sug- 
gests erosion by water or wind. CT scans 
have also revealed smaller iron-rich spher- 
ules resembling “blueberries,” the iron-oxide 
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concretions discovered on Mars a decade 
ago by the Opportunity rover and thought 
to have precipitated out of water. The edges 
of these veins and spherules would all be 
good places to look for organic signals, says 
Andrew Steele, a biogeochemist at the Carn- 
egie Institution for Science in Washington, 
D.C., who is probing the rock for organics. 

So far, Steele has found no hint of mar- 
tian biology— just trace amounts of organic 
molecules associated with volcanic pro- 
cesses. But he has found plenty of Earth 
bugs in the cracks— something that he 
takes as a good sign. “It’s a very 
habitable rock,” he says. “All it 
needs is a little warmth.” 

While the scientists continue 
to poke and prod, Piatek totes his 
martian meteorites to events and 
institutions around the world. He 
is a loving but not always careful guardian. 
In 2013, after the Tucson, Arizona, Gem 
and Mineral Show— a mecca for meteorite 
collectors— Piatek visited the house of 
Robert Ward, a meteorite hunter, dealer, 
and collector in Prescott, Arizona. The two 
started drinking and playing around with 
Ward’s new night vision goggles. They fig- 
ured that Black Beauty’s fragments, full of 
heat-retaining minerals, would stand out in 
the darkness like living things. So Piatek 
hid the stones outside in a freshly fallen 
snow. “We got done with the tequila and 
then remembered the Black Beauties hang- 
ing out in the snow,” Ward recalls. “It was 
like an Easter egg hunt,” Piatek says, chuck- 
ling. “It was crazy!” 


River of rocks 

In a nowhere place called Rabt Sbayta, 
meteorite hunters found pieces of Black 
Beauty across a strewn field more than 
10 kilometers long. 


To hear a podcast 
with author Eric 
Hand, see http:// 
scim.ag/pod_6213. 
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TOWARD THE END OF DINNER at the In- 
dianapolis steakhouse, Piatek receives a 
text from his niece-slash-personal assistant. 
‘We’re in!” he says. He asks the waitress to 
bag up our scraps of beef for the 13 dogs he 
and his wife keep at home. 

Brock’s sanctioned break-in has done the 
job. Piatek unzips the roller carry-on bag and 
begins to take out his pieces of Black Beauty, 
one by one. Accidently, he drops one and 
picks it up a bit sheepishly. “Oh, yeah, Smith- 
sonian would have died on that one,” he says. 

Piatek is handling the oldest rocks on 
Earth- hundreds of millions of 
years older than any rock native 
to our planet, where the constant 
chum of plate tectonics renews 
the crast. Black Beauty waited 
billions of years to get to Earth. 
Then, all in the last few years, 
pieces of it traveled to Rome; Washington, 
D.C.; Tucson; Tempe; Albuquerque; To- 
ronto- a meteoritical Grand Tour. As Piatek 
continues to make donations and trades, and 
as researchers slice and dice the stones into 
ever smaller chips, Black Beauty is spreading 
itself, entropically, across the world. Mars to 
Earth, rock to dust. 

Piatek shows me his favorite Black Beauty 
party trick. He takes the 525-gram main 
mass and the 241-gram piece and a little 7.7- 
gram shard, and twists the three around un- 
til, like a jigsaw puzzle, they lock perfectly 
into place— the way they must have been 
before the parent body broke up in the at- 
mosphere. “Look at that,” he says, marveling 
at the fit. “Isn’t that crazy?” In this moment, 
the arrow of time spins backward, and you 
can imagine all the pieces coalescing in 
reverse motion, gathering heft and speed 
and youth as they return from whence they 
came. Earth to Mars, dust to rock. 

A few months later, I call Piatek and catch 
him creaky-voiced, lying in the bed of a hotel 
room in Bally’s casino in Las Vegas, Nevada. 
Things have not gone well lately, and not just 
at the craps table. He and his wife are di- 
vorcing. His kids have left the nest. He’s not 
going to renew his Indiana Pacers season 
tickets. He is resolving to lead a simpler life, 
spending more time with family and friends. 
It also means shedding more of his meteor- 
ite collection. But not Black Beauty. That’s 
his touchstone, his bedrock— monetarily and 
metaphysically. It has both the pedigree of 
4.4 billion years and the promise of a future. 

“Some Black Beauty I’m going to have to 
keep forever,” he says. “For the kids, or for 
when I die. Because they might find some- 
thing in it.” 

“What if they find life? It’s the ‘what 
if’ is what I’m into. It’s like wow. That’s 
the cool thing about it: the potential of 
Black Beauty.” ■ 
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Climate adaptation via 
female education p.ioei 

Restoring trust in the 
financial services sector p.wes 



PERSPECTIVES 


ECOLOGY 


Disentangling 
global soil 
fiingal 
diversity 

A worldwide sampling effort 
reveals the global drivers of 
soil fungal biodiversity 

By David A. Wardle’ and Bjom D. Lindahl^ 

E cologists have long sought to under- 
stand global patterns of biological di- 
versity (i, 2). Most work on this topic 
has focused on visible aboveground 
organisms that can easily be counted, 
such as birds, butterflies, reptiles, 
and plants. In contrast, knowledge of the 
global ecology of most belowground organ- 
ism groups is limited because of their mi- 
croscopic size and hidden existence. Rapid 
advances in molecular techniques for ana- 
lyzing soil communities are now offering 
unprecedented opportunities for under- 
standing soil biodiversity {3, 4). On page 
1078 of this issue, Tedersoo et al. (5) use 
pyrosequencing of soil samples to provide 
a comprehensive global study of a major 
group of soil organisms: soil fungi. 

Soil ecologists have long been aware of the 
need to test whether global diversity patterns 
established for aboveground biota also apply 

Fungal diversity. Tedersoo et a/. (5) show that the ratio 
of fungal to plant diversity rises with Increasing distance 
from the equator. Consistent with this finding, high- 
latitude boreal forests support few tree and understory 
plant species, yet fungal diversity is high, as shown in the 
diverse range of fruiting bodies (including those of ecto- 
mycorrhizal, saprotrophic, and pathogenic fungi) found 
within a ~150-m radius in Lunsen forest near Uppsala, 
central Sweden. 
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to soil biota (6, 7). However, attempts to un- 
derstand the global ecology of soil biota have 
typically either synthesized data from differ- 
ent and often disparate studies, or included 
too few independent sites to usefully sepa- 
rate the effects of large-scale drivers such as 
geography, latitude, and macroclimate from 
local variation in soil properties. Tedersoo et 
al. overcome this problem by characterizing 
fungal communities in soil samples from 365 
separate locations worldwide (including all 
continents except Antarctica), all of which 
were sampled, processed, and analyzed In 
exactly the same way. 


nounced patterns In the natural world, at 
least above ground (9). In line with this, total 
fungal richness increases toward the equator, 
but major groups of fungi defy this pattern. 
For example, ectomycorrhizal fungal richness 
is greatest at mid- to high northern latitudes 
(coinciding with temperate and boreal for- 
est), and richness within several ascomycete 
groups (notably the Leotlomycetes, which 
include fungi that form mycorrhizal associa- 
tions with ericoid dwarf shrubs) increases 
toward the poles. Globally, fungal richness 
does not decline as sharply as plant species 
diversity tvith increasing latitude; the result 



Hidden diversity. The highiy diverse fungal fruiting bodies shown in the previous figure were photographed 
during a brief walk through this Swedish forest, which supports low plant diversity. 


The results indicate that at the global 
scale, mean annual precipitation is the 
strongest driver of the richness of fungal 
operational taxonomic units (“richness”). 
However, soil properties, notably soil pH 
and calcium concentration, also had impor- 
tant positive effects. Soil fungi are generally 
viewed as acldophlles (relative to bacteria), 
but the current results suggest that they 
have a wider range of tolerance, rather than 
a preference, for acidic conditions (8). 

Tedersoo et al. found that the relative 
richness of the main functional groups— 
ectomycorrhizal fungi, saprotrophs, and 
I pathogens— varies widely among Earth’s 
^ major blomes, consistent with these groups 
I each being driven by a separate set of fac- 
y tors. Ectomycorrhizal fungal richness Is 
g most strongly related to the richness of host 
plant species and high soil pH; saprotroph 
t richness Is positively related to mean an- 
> nual precipitation; and pathogen richness is 
= negatively related to latitude hut positively 
I related to nitrogen availability. 

I The decline of species richness with in- 
Si creasing latitude is one of the most pro- 
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is that the ratio of fungal to plant richness 
rises exponentially toward the poles. Fungi 
are thus a key component of total terrestrial 
biodiversity at high latitudes, with impor- 
tant implications for conservation (see the 
figures). Reliable estimates of this ratio are 
important for deriving global fungal diversity 
from measures of plant diversity (10). 

Tedersoo et alls analysis also highlights the 
roles of biogeography and evolutionary his- 
tory In driving fungal communities within 
regions. First, fungal taxa at higher latitudes 
on average had larger geographic ranges 
than did those nearer the equator, pointing 
to higher fungal endemism in tropical re- 
gions. This is supportive of Rapoport’s rule, 
which predicts more restricted geographic 
ranges for higher-latitude taxa and has often 
been demonstrated for aboveground biota. 
Second, through analyses aimed at assess- 
ing shared fungal diversity among different 
biomes or regions, the authors could Iden- 
tify similarities in communities between 
Southern Hemisphere land masses, be- 
tween Northern and Southern Hemisphere 
temperate regions, and between paleo- and 
neotropical regions. These biogeographic 
patterns can be explained in part through 
comigration with hosts over Pleistocene land 
bridges and in part through long-distance 
dispersal by spores. 
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The main functional groups of fungi 
drive many ecological processes, and ques- 
tions therefore remain as to how the global 
variation In fungal communities character- 
ized hy Tedersoo et al. interacts with other 
ecosystem components. For example, sapro- 
trophs and mycorrhizal fungi play opposite 
roles in soil organic matter formation {11, 
12). Shifts in their relative dominance may 
therefore have major implications for net 
ecosystem carbon exchange and sequestra- 
tion. Furthermore, vegetation and soil fungi 
are tightly linked via mycorrhizal symbio- 
sis, pathogenic interactions, and nutrient 
release during decomposition. Shifts in the 
relative diversity of major fungal groups 
could thus potentially affect plant pro- 
ductivity and diversity (13). Moreover, soil 
fungi serve as food for many soil Inverte- 
brates. It remains to be shown whether the 
variation in fungal communities character- 
ized by Tedersoo et al. is reflected In corre- 
sponding global patterns in soil nematodes, 
mites, and springtalls. 

Global studies of the type performed 
by Tedersoo et al. are also helpful for bet- 
ter understanding how the belowground 
subsystem may respond to global envi- 
ronmental change. Improved knowledge 
about links between macrocllmate and fun- 
gal communities will help to predict how 
global climate change is likely to affect the 
relative abundances of key fungal groups 
and thereby alter fungal-driven ecological 
processes. Further, because most of the soil 
fungal community remains undescrihed, we 
are currently unable to assess with any reli- 
ability the extent to which subsets of this 
community are likely to be threatened or 
to be spread (and potentially become Inva- 
sive) through human activity. Studies such 
as that by Tedersoo et al. that enable bet- 
ter characterization of the soil mycoblome 
may serve as benchmarks against which we 
can assess rearrangement of species assem- 
blages caused by human activity. ■ 

REFERENCES 

1. M. Huston, Biological Diversity: The Coexistence of 
Species on Changing Landscapes (Cambridge Univ. Press, 
Cambridge, 1994). 

2. K. J. Gaston, T. M. Biackburn, Pattern and Process in 
Macroeco/ogy (Biackweii Science, Oxford, 2000). 

3. N. Fierer eta/., Science 342, 621 (2013). 

4. J. M.Taibot eta/.,Proc. Natl. Acad. Sci. U.S.A. 111,6341 
(2014). 

I 5. L.Tedersooefa/.,Sc/ence346, 1256688(2014). 

6. D. A. Wardie, Communities and Ecosystems: Linking the 
Aboveground and Belowground Components (Princeton 
Univ. Press, Princeton, NJ, 2002). 

7. T. Decaens, Glob. Ecol. Biogeogr. 19,287 (2010). 

8. J. Rousketal., Soil Biol. Blochem. 42,926(2010). 

9. P. H.Tayior, S. D. Gaines, fco/ogy 80,2474(1999). 

10. D.L. Taylor eta/., £co/. Monogr. 84,3(2014). 

11. K. E. Ciemmensen etal., Science 339, 1615 (2013). 

12. G. Averill, B. L. Turner, A. G. Finzi, Nature 505, 543 (2014). 

13. W. H. Van der Putten eta/., J. Ecol. 101, 265 (2013). 

10. 1126/science. aaall85 
28 NOVEMBER 2014 • VOL 346 ISSUE 6213 1 0 53 


Published byAAAS 


INSIGHTS 


MEDICINE 

Big data meets public health 

Human well-being could benefit from large-scale data if large-scale noise is minimized 


By Muin J. Khoury*'^ and 
John F. A. loannidis'’ 

I n 1854, as cholera swept through Lon- 
don, John Snow, the father of modern ep- 
idemiology, painstakingly recorded the 
locations of affected homes. After long, 
laborious work, he implicated the Broad 
Street water pump as the source of the 
outbreak, even without knowing that a Vib- 
rio organism caused cholera. “Today, Snow 
might have crunched Global Positioning 
System information and disease prevalence 
data, solving the problem within hours” (J). 
That is the potential Impact of “Big Data” on 
the public’s health. But the promise of Big 
Data is also accompanied by claims that “the 
scientific method itself is becoming obso- 
lete” (2), as next-generation computers, such 
as IBM’s Watson (3), sift through the digital 
world to provide predictive models based 
on massive Information. Separating the true 
signal from the gigantic amount of noise is 
neither easy nor straightforward, but it is a 
challenge that must be tackled if informa- 
tion is ever to be translated into societal 
well-being. 

The term “Big Data” refers to volumes of 
large, complex, linkable information (4). Be- 
yond genomics and other “omic” fields, Big 
Data includes medical, environmental, fi- 
nancial, geographic, and social media Infor- 
mation. Most of this digital information was 
unavailable a decade ago. This swell of data 
will continue to grow, stoked by sources that 
are currently unimaginable. Big Data stands 
to improve health by providing Insights into 
the causes and outcomes of disease, better 
drug targets for precision medicine, and en- 
hanced disease prediction and prevention. 
Moreover, citizen-scientists will Increasingly 
use this Information to promote their own 
health and wellness. Big Data can improve 
our understanding of health behaviors 
(smoking, drinking, etc.) and accelerate the 
knowledge-to-diffusion cycle (5). 

But “Big Error” can plague Big Data. In 
2013, when Influenza hit the United States 
hard and early, analysis of flu-related Inter- 
net searches drastically overestimated peak 
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From validity to utility. Big Data can improve tracking 
and response to infectious disease outbreaks, discovery 
of early warning signals of disease, and development of 
diagnostic tests and therapeutics. 

flu levels (6) relative to those determined 
by traditional public health surveillance. 
Even more problematic is the potential for 
many false alarms triggered by large-scale 
examination of putative associations with 
disease outcomes. Paradoxically, the pro- 
portion of false alarms among all proposed 
“findings” may increase when one can mea- 
sure more things (7). Spurious correlations 
and ecological fallacies may multiply. There 
are numerous such examples (8), such as 
“honey-producing bee colonies inversely cor- 
relate with juvenile arrests for marijuana.” 

The field of genomics has addressed this 
problem of signal and noise by requiring 
replication of study findings and by asking 
for much stronger signals In terms of statisti- 
cal significance. This requires the use of col- 
laborative large-scale epidemiologic studies. 


For nongenomlc associations, false alarms 
due to confounding variables or other biases 
are possible even with very large-scale stud- 
ies, extensive replication, and very strong 
signals (5). Big Data’s strength is in finding 
associations, not in showing whether these 
associations have meaning. Finding a signal 
is only the first step. 

Even John Snow needed to start with a 
plausible hypothesis to know where to look, 
i.e., choose what data to examine. If all he 
had was massive amounts of data, he might 
well have ended up with a correlation as 
spurious as the honey bee-marijuana con- 
nection. Crucially, Snow “did the experi- 
ment.” He removed the handle from the 
water pump and dramatically reduced the 
spread of cholera, thus moving from correla- 
tion to causation and effective intervention. 

How can we improve the potential for 
Big Data to Improve health and prevent 
disease? One priority is that a stronger 
epidemiological foundation is needed. Big 
Data analysis is currently largely based on 
convenient samples of people or informa- 
tion available on the Internet. When as- 
sociations are probed between perfectly 
measured data (e.g., a genome sequence) 
and poorly measured data (e.g., adminis- 
trative claims health data), research ac- 
curacy Is dictated by the weakest link. Big 
Data are observational in nature and are 
fraught with many biases such as selection, 
confounding variables, and lack of general- 
izablllty. Big Data analysis may be embed- 
ded In epidemlologlcally well-characterized 
and representative populations. This epide- 
miologic approach has served the genomics 
community well (10) and can be extended 
to other types of Big Data. 

There also must be a means to integrate 
knowledge that is based on a highly Itera- 
tive process of Interpreting what we know 
and don’t know from within and across sci- 
entific disciplines. This requires knowledge 
management, knowledge synthesis, and 
knowledge translation (11). Curation can be 
aided by machine learning algorithms. An 
example is the ClinGen project (12) that will a 
create centralized resources of clinically an- g 
notated genes to improve interpretation | 
of genomic variation and optimize the use I 
of genomics in practice. And new funding, g 
such as the Biomedical Data to Knowledge ^ 
awards of the US. National Institutes of | 
Health, will develop new tools and training | 
in this arena. J 
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MATERIALS DESIGN 

Simulation as a tool 
for biopolymers design 

New materials made from biobased raw materials can 
be formulated faster with computer simulation of their 
reaction chemistry 


Another important issue to address is 
that Big Data is a hypothesis-generating ma- 
chine, but even after robust associations are 
established, evidence of health-related util- 
ity (i.e., assessing balance of health benefits 
versus harms) is still needed. Documenting 
the utility of genomics and Big Data infor- 
mation will necessitate the use of random- 
ized clinical trials and other experimental 
designs (13). Emerging treatments based on 
Big Data signals need to be tested in inter- 
vention studies. Predictive tools also should 
be tested. In other words, we should em- 
brace (and not run away from) principles of 
evidence-based medicine. We need to move 
from clinical validity (confirming robust re- 
lationships between Big Data and disease) to 
clinical utility (answering the “who cares?” 
health impact questions). 

As with genomics, an expanded transla- 
tional research agenda (14) for Big Data is 
needed that goes beyond an initial research 
discovery. In genomics, most published 
research consists of either basic scientific 
discoveries or preclinical research designed 
to develop health-related tests and interven- 
tions. What happens after that in the bench- 
to-bedside journey is a “road less traveled” 
with <1% of published research (15) dealing 
with validation, evaluation, implementation, 
policy, communication, and outcome re- 
search in the real world. Reaping the bene- 
fits of Big Data requires a “Big Picture” view. 

Bringing Big Data to bear on public health 
is where the rubber meets the road. The 
combination of a strong epidemiologic foun- 
dation, robust knowledge integration, prin- 
ciples of evidence-based medicine, and an 
expanded translation research agenda can 
put Big Data on the right course. ■ 
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By Galen J. Suppes 

U any polymers are made from sim- 
ple small molecules (monomers) 
derived through many processing 
steps from petroleum. Biologically 
sourced materials are more sustain- 
able, but to be cost competitive, 
new products will likely need to be based 
on different structures composed of natural 
oils or alcohols that have undergone mini- 
mal processing. For example, plant-derived 
oils can be made into polymers, but these 
oils can bear many more functional groups 
in variable positions, which poses consider- 
able challenges to controlling their reaction 
chemistry. The use of computer simulation 


based on experimental inputs offers a way 
to control this complexity and transform 
materials design, just as process simuiators 
have transformed engineering design. This 
perspective presents a case that simulation 
is ready to change the way we research, de- 
velop, and design thermoset pol 5 mier recipes 
(formulations). 

THERMOSET POLYMERS. The spray foam 
insulation dispensed from pressurized cans 
for fiiiing cracks and crevices is an example 
of a thermoset polymer. The two compo- 
nents, isocyanate and polyol monomers, 
combine during the spray and react to form 
a polyoirethane; the degree of foaming is 
controlled by in situ gas generation from re- 
actions (e.g., water and isocyanate) or evapo- 
ration. Unlike thermoplastics, which can be 


heated and then molded, thermosets adopt a 
final fixed shape. 

The degrees of freedom of a typical ure- 
thane foam formulation include selecting 
types and amounts of monomers (4 to 7 
degrees of freedom), catalysts (2 to 4), sur- 
factants (2), and blowing agents (2 to 4). 
Biobased materials (1-5), fillers, and fire 
retardants can add additional degrees of 
freedom. Historically, the handling of these 
10 to 23 degrees of freedom in specifying a 
urethane has been much like an “art” in the 
hands of a Ph.D. chemist with decades of ex- 
perience. Predicting the reaction progress is 
not simple because of the many simultaneous 
physical and chemical processes. Hundreds 
of customer-specific urethane formuiations 


depend on the proper combinations of these 
parameters and must meet constraints on 
product performance, raw material costs, 
and throughput (i.e., cure time). 

USE OF SIMULATION. Chemical process 
simulators have transformed the design of 
many reaction and separation processes 
when applications have a large numbers of 
degrees of freedom, simultaneous solution 
of multiple governing equations, and iarge 
markets that provide good returns on in- 
vestment in simuiation process. Simulation 
is used for both designing new processes 
and controlling existing processes. With 
cumulative markets exceeding $20 billion 
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Just right. Curing a polyurethane foam is like baking bread— it must trap gas bubbles in order to rise (B). If the 
“dough" is too runny (too low in viscosity), bubbles escape (A); too stiff (too high in viscosity), the gas is trapped and 
no bubbles form (C). 
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per year, urethane foam formulations share 
these characteristics. 

During simulation, an ordinary differen- 
tial equation solver is used to simultaneously 
solve dozens of component reaction rates, 
mass and energy balances, and mass-transfer 
rates. The output starts with concentration, 
temperature, degree of polymerization, foam 
density and height, and viscosity profiles. 
From these profiles, the success or failure 
of a foam formulation can be projected. The 
first figure shows how the viscosity increase 
of the poisoner must be coordinated with 
temperature increase (evaporation of blow- 
ing agent). If viscosity increases too fast, the 
blowing agent is entrapped in the resin; too 
slow, and percolating bubbles escape to the 
atmosphere. 

Ultimately, even greater detail on such 
properties, such as thermal conductivity, 
compressive strength, and stress-strain 
relations, are desired. Simulation can guide 
the modification of formulations to accom- 
modate alternative monomers, catalysts, 
or blowing agents that might improve 
performance or lower costs. Basic kinetic 
and physical property data are needed to 
simulate the effect of new formulation 
components. Kinetic information can be 
obtained by evaluating the temperature 
profiles of components in gel tests, which 
are simplified formulations without blow- 
ing agents) (d). 


A computer simulation that can predict 
the performance of numerous mixtures 
based on the performance of a limited num- 
ber of pure components allows for rapid and 
relatively inexpensive analysis of how new 
components affect performances of exist- 
ing formulations. If the simulator accurately 
predicts the performance profiles of an exist- 
ing formulation, it will likely also predict the 
effect of a formulation change. For the ure- 
thane industry, formulations are tested with 
a series of cup or box mold foam samples fol- 
lowed by testing and final adjustment with 
pilot scale or production line equipment. 
Simulation can substantially reduce or elim- 
inate laboratory work. 

Reducing the time and expense of adopt- 
ing new biobased polymers can be critical 
for their adoption, and simulation can create 
evolutionary paths in polymer development 
that would otherwise not occur. Unlike pe- 
troleum polyols such as V490 that are built 
typically from ethylene oxide and propylene 
oxide with a small number of hydroxyl groups 
(2 to 6), soybean oil has 6 to 7 carbon-carbon 
It bonds through which chemistry is used to 
add reactive moieties and control oligomer- 
ization. A practical application of the effect 
of gel temperature profiles is illustrated with 
the soy-based polyol (A205) (see the second 
figure). Compared with the petroleum-based 
monomer V490, A205 has a lower heat re- 
lease associated with its lower hydroxyl 


Comparing synthetic and natural sources 



Simulations can help lower costs. Comparison of petroleum-based polyol (V490) to soy-based polyol (A205). 
Dashed lines are for catalyst loading of 0.3%, and solid line is for catalyst loading of 0.12%. The more reactive soy 
polyol will provide similar performance (based on the temperature-time profiles) but will use less of the relatively 
expensive catalyst. 


number and is more reactive. This higher re- 
activity was unexpected and provided the op- 
portunity to reduce the catalyst loading from 
0.3 to 0.12 weight percent, which cannot be 
done for the less reactive V490. 

FUNDAMENTAL INSIGHT. Temperature 
profiles can confirm that the anticipated 
urethane-forming reactions are occurring. 
Ghoreshi et al. (7) identified that polyol re- 
activity can be specified by identifying the 
fractions of primary, secondary, and hin- 
dered secondary alcohols. The kinetic and 
thermodynamic parameters of the polyol 
are then specified by its functionality (num- 
ber of alcohol moieties per molecule), the 
concentration of alcohol moieties (hydroxyl 
number), and these fractions. This approach 
reduces the number of parameters needed to 
characterize a polyol, and such analysis veri- 
fies that the polyol performs as anticipated. 

Afypical polyols may have reactive moi- 
eties other than alcohols, may have nitrogen 
heteroatoms that provide catalytic ability, or 
may be marketed as mixtures including cat- 
alysts. Concentration profiles generated by 
the simulation provide insight into the pro- 
cess that is not available from experimental 
study alone. Simulation of the blowing agent 
allows the height (density) of the foam to be 
estimated and fine-tuned. Density is a pri- 
mary indicator of performance and a key 
characteristic matched with application. 

AN ENABLING TECHNOLOGY. An exciting 
aspect of simulation of thermoset polymer- 
ization is the ability to enable fundamental 
sciences to more directly impact commercial 
product development. Niche successes in 
this application can expand and even trans- 
form how new materials are created. The 
databases and standards of performances 
established by simulators can form a foun- 
dation for advancing science and engineer- 
ing that exceed alternatives like archival 
refereed publications. Simulators represent 
a key step in the evolution of science and 
technology when fields reach a certain point 
of sophistication. ■ 
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Peeking into Earth's interior. This schematic cross section shows the layered structure and mineralogical model 
of Earth. The mantle transition zone is defined by a rapid increase in seismic velocities bracketed by the 410- and 
660-km seismic discontinuities. The lower mantle, which makes up the largest volume portion of Earth, consists of 
predominantly bridgmanite and ferropericlase. The key to understanding the structure and dynamics of the deep 
earth is to understand the physical and chemical properties of bridgmanite. 


EARTH SCIENCE 

Bridgmanite-named at last 

The most abundant mineral in Earth’s interior gets a name 


By Thomas Sharp 

T he most abundant solid phase in 
Earth’s interior, making up 38% of 
Earth’s volume, is magnesium iron 
silicate (Mg,Fe)Si 03 . This mate- 
rial is known to form a perovskite 
structure. However, no samples of 
the mineral can be obtained from Earth’s 
lower mantle; ■without a well-characterized 
natural sample, it has not been possible to 
formally name the mineral. On page 1100 
of this issue, Tschauner et al. (1) use syn- 
chrotron micro-x-ray diffraction and elec- 
tron probe microanalysis to determine the 
crystal structure and composition of natu- 
ral (Mg,Fe)Si03-perovskite in a shocked 
L chondrite (a stony, low-iron meteorite). 
They name it bridgmanite, after the father 
of high-pressure experiments, Percy W. 
Bridgman (1882 to 1961). 

A mineral is a naturally occurring crys- 
talline solid "with distinct, but not fixed. 
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chemical composition. According to the 
commission on new mineral names, one 
must characterize the structure and compo- 
sition of a natural sample to give it a mineral 
name. Previous reports of natural MgSiO^- 
perovskite have shown that it can occur in 
shocked meteorites (2, 3), but in both cases, 
the material was damaged by irradiation 
during transmission electron microscope 
analysis, and the structure had not been 
well characterized. Tschauner et al!s careful 
characterization finally allows this impor- 
tant mineral to be named. 

Earth scientists know the physical struc- 
ture of the deep Earth mainly from seismic 
studies, which have shown that seismic 
velocities change suddenly at particular 
depths. These seismic discontinuities show 
that Earth has a layered structure consist- 
ing of a solid inner core, a liquid outer core, 
a lower mantle, and an upper mantle (see 
the first figure). To determine which solid 
materials make up these layers of the inner 
Earth, we need to know how Earth’s mate- 
rials react to extreme pressures and what 
structures are stable under the extreme 


conditions of pressure and temperature 
within Earth (4). This information is gained 
from a combination of high-pressure ex- 
periments, thermodynamics, and modeling 
studies, with important confirmation from 
natural samples, usually (although not al- 
ways) from meteorites. 

High-pressure experiments use devices 
such as the diamond-anvil cell (DAC) and 
the multi-an'vil apparatus to generate the 
extreme pressures of planetary interiors. 
The groundwork for these experiments was 
laid by Bridgman. More than 100 years ago, 
he invented a high-pressure apparatus and 
pressure seal that solved the problem of 
sample containment (5). With this press, 
Bridgman was able to compress materials 
to 7000 atmospheres (atm). By incorporat- 
ing first a hydraulic press and then carbide 
anvils, he could eventually reach pressures 
of 50,000 to 100,000 atm (5 to 10 GPa) (6). 
Bridgman’s opposing-anvil design with a 
cylindrical gasket led to the development 
of the DAC, which has allowed mineral 
physicists to reproduce the conditions 
found in the deep Earth and other plan- 
etary interiors (4). 

The combination of experimental mineral 
physics, thermodynamics, and first-princi- 
pal theory has provided a detailed model 
of the inner Earth that is consistent with 
geophysical observations and a bulk-earth 
chemistry resembling that of chondrites (see 
the first figure). High-pressure experiments 
provide a direct means of determining the 
stabilities of various structures as a function 
of pressure, temperature, and composition, 
as well as physical properties such as elas- 
ticity. From mineral physics, we know that 
the upper mantle consists of familiar min- 
erals such as olivine, pyroxenes, and garnet. 
However, in the mantle transition zone, 
higher-density spinel and garnet structures 
predominate. In the lower mantle, the even 
denser perovskite and rock-salt structures 
are stable. Bridgmanite (perovskite struc- 
ture) differs from upper mantle and crustal 
minerals In that silicon fills octahedral 
rather than tetrahedral sites. The high-den- 
sity structure of bridgmanite is unstable at 
ambient pressures, where it readily trans- 
forms to a low-density glass if heated. 

Some materials from the deep Earth, 
such as diamonds, are delivered to Earth’s 
surface, but most deep-mantle silicates do 
not survive the journey to the surface. An 
exception is a sample of rlngwoodite, which 
reached the surface as an inclusion in dia- 
mond and pro'vided evudence for substantial 
amounts of water in the mantel transition 
zone (7). Nearly all other natural examples 
of deep-mantle minerals come from the 
asteroid belt (8). Hypervelocity impacts on 
meteorite parent bodies cause short pulses 
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Deep-mantle minerals in a meteorite. This shock 
melt vein in the L chondrite, Roosevelt County 106, 
consists of quenched chondritic melt (nearly black 


matrix) with entrained meteorite fragments. The olivine 
in the fragments has transformed into deep-blue 
ringwoodite. The image is about 1mm wide. 

of very high pressures and locally high tem- 
peratures that melt and transform meteor- 
itic materials into the same high-pressure 
phases that make up the deep Earth. During 
shock, the impacted rocks are compressed to 
pressures of more than 25 GPa. In highly de- 
formed regions, such as sheer bands or col- 
lapsed pores, sample temperatures exceed 
the melting temperatures of the materials 
(-2500 K at 25 GPa), resulting in shock-melt 
veins and pockets (9). It is in and adjacent to 
these melt zones that olivine, pyroxene, and 
plagioclase get hot enough to transform into 
their high-density polymorphs. 

Shocked meteorites have provided natu- 
ral examples of deep-mantle minerals (see 
the second figure) since Binns et al. {10) dis- 
covered ringwoodite, the spinel-structured 
polymorph of olivine, in a shocked chondrite 
and named it after A. E. Ringwood. Many 
high-pressure minerals have been named 
after important high-pressure experimen- 
talists. Bridgmanite is thus a very fitting 
name for the most abundant mineral in the 
Earth. Continued investigation of shock ef- 
fects in meteorites and terrestrial rocks will 
likely provide many more natural examples 
of minerals from deep within Earth or other 
planetary bodies. ■ 
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PSYCHOLOGY 

How quickly we forget 

Why do we recall some things better than others? 


By David C. Rubin 

S ince the beginning of the scientific 
study of human memory, there has 
been a tension between those who 
stress experimental control, even at 
the risk of oversimplifying and thus 
iosing aspects of the behavior of in- 
terest (i), and those who stress studying 
behavior as close as possible to the way it 
exists naturally, with only enough control to 
produce replicable results (2, 3). The first ap- 
proach optimisticaiiy assumes that the the- 
ory tested in the laboratory will generalize 
to more varied situations; the second coun- 
ters that a careful, preferably quantitative, 
description of the phenomenon is needed— 
ciose to its full complexity— before a reason- 
able theory can be formulated, otherwise 
important aspects may be removed before 
a theory is considered. When it is possible, 
as in the study by Roediger and DeSoto (4) 
on page 1106 of this issue, combining both 
approaches is the most productive strategy. 

Roediger and DeSoto examined the pat- 
tern and rate at which individuals (in the 
United States) forget U.S. presidents over 
time. The authors examined two groups— 
three generations of students (totaling 415 
individuals in 1974, 1991, and 2009) from 
different U.S. universities, and 497 aduits in 
2014. Among their observations is a similar 
pattern of enhanced recall of the first U.S. 
presidents (Washington, Adams, and Jef- 
ferson), of Lincoln, of the current president 
(at the time of testing), and of presidents 
close in time to the time of the test (attrib- 
uted to the “recency effect”). On the basis of 
their results, Roediger and DeSoto extrapo- 
late when the fame of recent presidents, as 
measured by the ability of people to recall 
them, will reach the level of Presidents Fili- 
more, Buchanan, and McKiniey (who were 
not weli remembered). 

The findings of the study overail suggest 
an integration of personal memory and the 
memory of major historical events. This is 
because of an observed continuity in mem- 
ory for historical events (U.S. presidents) 
that occurred before and after one’s birth. 
The recall data of Roediger and DeSoto de- 
crease smoothiy with time from the pres- 
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Remember the president? Across generations (since 
the 1970s), people in the United States forget U.S. 
presidents in the same way, with strongest recall only 
for the earliest and most recent presidents (but also for 
Lincoln) (4). 
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Knowledge of events 


70 



Age at time of event 


Awards and events. Correct answers to multiple-choice 
questions are shown as a function of participants' age 
at the time of the event. The vertical dotted line denotes 
the birth of the participants. They axis begins at the 
chance guessing level of 20%. [Adapted from (5).] 

ent that does not change dramatically at 
birth or during early development, a time 
period from which adults can recall few 
memories. For a variety of topics, including 
current events, entertainment, and sports, 
the difference between one’s life and his- 
torical time is not a major factor (5). Two 
examples are winners of Academy Awards 
and important news events (see the graph). 
Acquisition of such historical knowledge is 
from exposure that need not come at the 
time of the event and in which the number 
and spacing of repeated exposures and the 
mechanisms of human cognition can make 
some events memorable beyond their his- 
torical importance (5-8). 

There are two explanations for the smooth 
decrease in recall from the present observed 
by Roediger and DeSoto that are based on 
when and how often a person is reminded of 
an event, not on other factors. At a systems 
level, the probability that particular knowl- 
edge will be needed can be estimated from its 
recent past usage, with remote usage having 
a smaller effect. This past usage can be used 
to adjust its availability for recall— much as 
a library might arrange books for ease of ac- 
cess by recent popularity— but also consider- 
ing, to a lesser degree, remote usage (9). In 
addition, based on past usage, the number 
and recency of past exposures could be all 
that affects ease of recall. Thus, if one has re- 
peatedly heard about something in the past 
few months, it is more likely to be recalled. 

However, beyond the impact of the envi- 
ronment, there is considerable knowledge 
of memory mechanisms from studies of 
information learned and retained over the 
life span and over generations, which can 
be used to understand why some things are 
recalled better than others. A key factor is 
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the nature of the longer-term spacing of “re- 
hearsal.” For instance, Bahrick and Hall (8) 
found minimal loss in knowledge of high 
school algebra over 50 years among people 
who took at least one college course at or 
above calculus level, whereas those who did 
equally well in high school algebra but took 
no college mathematics declined to near- 
chance levels of remembering algebra. Simi- 
larly, Bahrick (7) found that individuals who 
took Spanish courses in high school or dur- 
ing college (spaced over multiple semesters) 
but were not exposed to Spanish thereafter 
lost much of that knowledge within the first 
6 years after their final Spanish class. This 
observed recall as a function of time is simi- 
lar to that observed by Roediger and DeSoto. 
Similarly, annual holidays (such as Veterans 
Day in the United States) and more widely 
spaced commemorations (such as might oc- 
cur at 10-, 25-, or 100-year intervals after the 
end of a war) provide this spaced rehearsal 
for historical events. 

With respect to cognitive mechanisms, 
memories of historical events, people, and 
even monuments meant to serve as remind- 
ers are all affected more by visual and spatial 
imagery than by other factors such as narra- 
tive (6). The combination of cues from vari- 
ous sensory modalities, emotion, language, 
narrative, and especially visual and spatial 
imagery of scenes helps keep memory avail- 
able and stable for centuries, well above 
what would be expected from the mnemonic 
effects of individual cognitive systems used 
in isolation (10-12). 

By combining careful measurement tech- 
niques with the study of events that span 
centuries, Roediger and DeSoto demon- 
strate the potential for the scientific study 
of the transmission and retention of cultural 
knowledge based both on exposure and the 
mechanisms that shape human memory. ■ 
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CELL BIOLOGY 

Powering the 
cell cycle 

Cell division is linked to 
mitochondrial protein 
transport 

By Christian Schulz* and Peter Rehling’^ 

T O live means to grow, respond, re- 
produce, and adapt. All these pro- 
cesses require energy, which in most 
eukaryotic cells is provided by mi- 
tochondrial oxidative phosphoryla- 
tion. Moreover, mitochondria have 
been implicated in other vital cellular pro- 
cesses, including programmed cell death 
(apoptosis) and calcium signaling. Hence, 
mitochondrial functions have to be tightly 
integrated into the cellular context, and the 
mechanisms that participate in this coordi- 
nation are only just beginning to be under- 
stood. On page 1109 of this issue, Harbauer 
et al. (1) identify a link between the cell divi- 
sion cycle and mitochondrial protein trans- 
port as a driver of this process. 

To maintain mitochondrial functional- 
ity and thus ensure cellular survival, newly 
s)mthesized mitochondrial proteins must 
be imported into the organelle from the 
c 5 d:oplasm. Numerous pathways supply the 
various mitochondrial subcompartments 
with their resident proteins. How these 
transport processes are integrated into the 
cellular context and adapted to the cellular 
requirements is only recently becoming ap- 
parent (2, 3). An unexpected observation is 
that mitochondrial protein translocation is 
regulated by cytosolic and mitochondrial 
enzyones (protein kinases). A major phos- 
phorylation target of these kinases is the 
translocase of the outer membrane (TOM), 
which represents the entry gate for almost 
all protein precursors destined for the 
mitochondria. 

Tom6 is a constituent of the TOM com- 
plex, and its expression (at the transcription 
level) undergoes a cell cycle-dependent reg- 
ulation (see the figure) (4). Thus, Harbauer 
et al. investigated the fate of the Tom6 pro- 
tein during the cell cycle in budding yeast. 
A marked increase of Tom6 during the 
phase-M phase transition in the cell cycle 
led to the identification of a cyclin-depen- 
dent kinase 1 (Cdkl)- and cyclin Clb3-me- 
diated phosphorylation event on serine-16 
of c 5 d:osolic Tom6 protein. This modifica- 
tion increased the rate of import of Tom6 
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Mitochondria and mitosis. During M phase of the cell division cycle, TOMS mRNA and protein abundance increase. Cdkl-Clb3 phosphorylates Tom6, thereby improving its targeting to 
mitochondria. This increases TOM biogenesis, which imports Mgml and Fzol, proteins that stimulate energy production to support efficient progression through M phase. 


into the mitochondrial outer membrane. 

Tom6 displays characteristic features of 
a tail-anchored protein. The mitochondrial 
import protein 1 (Miml) promotes insertion 
of Tom6 into the outer mitochondrial mem- 
brane. Tom6 is then assembled into the TOM 
complex, a process mediated by the sorting 
and assembly machinery (2). Miml itself is 
regulated by a casein kinase 2-dependent 
phosphorylation event that stimulates its 
ability to import other constituents of the 
TOM complex (Tom20 and TomVO) (2). This 
interconversion of Miml function allows 
coupling of mitochondrial import capacity 
to the metabolic activity of the cell. 

The increase in steady-state amounts of 
membrane-integrated, phosphorylated Tom6 
promotes the biogenesis of the TOM com- 
plex. Surprisingly, an increase in the amount 
of TOM complex does not lead to a notable 
acceleration in precursor translocation per 
se; instead, only import of a small subset of 
substrates, namely mitochondrial genome 
maintenance 1 (Mgml) and fuzzy onions 1 
(Fzol), is stimulated. How is this substrate 
selectivity explained? The phosphorylated 
cytosolic domain of Tom6 recognizes the tar- 
geting signal of Mgml, thereby facilitating 
its targeting and subsequent import. In the 
case of the outer membrane protein Fzol, a 
different mechanism applies; its transmem- 
brane segment and the intermembrane space 
loop are recognized by Tom6. As previously 
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observed, the increased presence of these 
proteins in the mitochondria stimulates res- 
piration and consequently, energy output (5). 
Harbauer et al. suggest that this mitochon- 
drial supply of energy powers progression 
through mitosis (d), particularly the energy- 
demanding steps of cytokinesis. Intriguingly, 
Mgml and Fzol are components of the fusion 
machinery of the inner and outer mitochon- 
drial membrane, respectively (7, 8). It may 
be that these proteins ensure equilibrium of 
mitochondrial fusion and fission, which is 
critical for controlled distribution of the or- 
ganelle during mammalian cell division {9). 

Whereas Harbauer et al. describe how mi- 
tochondrial functions are adapted to cellular 
demands through signaling pathways, di- 
rect physical contact between mitochondria 
and other organelles also coordinates this 
responsive interaction. Such contacts have 
been observed for decades, although their 
functions and molecular make-up have only 
recently been explored {10). Direct connec- 
tion between the endoplasmic reticulum and 
mitochondria is thought to benefit calcium 
signaling, lipid metabolism, mitochondrial 
division and distribution during cell division, 
and autophagosome formation {11, 12). Con- 
tacts between mitochondria and the plasma 
membrane, peroxisomes, and the vacuole 
have been found, but their roles remain 
mysterious {10, 13, 14). These direct contacts 
allow for integration of mitochondria with 
cellular processes in other parts of the cell. 

The findings of Harbauer et al. now ex- 
tend our views on how mitochondrial func- 
tion can be regulated. At the same time, 
the study raises questions about further 


communication mechanisms between mi- 
tochondria and the signaling pathways 
that drive, and control, cell division. Sig- 
naling processes apparently help to adapt 
mitochondrial activity to the requirements 
of mitosis through a modulation of TOM 
complex function. It is unclear if newly 
assembled TOM complexes are also con- 
trolled to specifically increase import of 
other substrates despite Mgml and Fzol. It 
is also unknown how mRNA abundance, es- 
pecially of T0M6, is controlled. In addition 
to the integration of mitochondria into cel- 
lular signaling networks during metabolic 
shift and the cell cycle, it can be expected 
that the same will hold true during cell dif- 
ferentiation (15). Revealing the signaling 
pathways regulating this integration will be 
a major area of future research. ■ 
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ENVIRONMENT AND DEVELOPMENT 

Universal education is key to 
enhanced climate adaptation 

Fund more educators rather than just engineers 


By Wolfgang Lutz, Raya Muttarali, 

Erich Striessnig* 

O ver the coming years, enormous 
amounts of money will likely be 
spent on adaptation to climate 
change. The international commu- 
nity recently made pledges of up to 
$100 billion per year by 2020 for the 
Green Climate Fund. Judging from such cli- 
mate finance to date, funding for large proj- 
ects overwhelmingly goes to engineers to 
build seawalls, dams, or irrigation systems 
(J). But with specific projections of future 
changes in climate in specific locations still 
highly uncertain, such heavy concrete (in 
both meanings) and immobile 
POLICY investments that can lock coun- 
tries into certain paths may not 
be the best way to go (2). Our new study 
suggests that it may be efficient and effec- 
tive to give part of this fund to educators 
rather than engineers. Public investment in 
universal education in poor countries in the 
near future should be seen as a top priority 
for enhancing societies’ adaptive capacity 
vis-a-vis future climate change. 

Recent research suggests that general 
empowerment of populations through uni- 
versal primary and secondary education 
is not only essential to poverty alleviation 
and economic growth but also to reducing 
vulnerability to natural disasters { 3 , 4 ). It 
is not unreasonable to assume that factors 
that helped reduce vulnerability to floods, 
tropical storms, and droughts over the past 
decades will help reduce future vulnerabil- 
ity to climate change. We present findings 
from the most comprehensive global-level 
assessment of the effects of education on 
disaster fatalities (measured as the logged 
number of deaths per million of popula- 
tion) from hydro-meteorological hazards 
that are likely to be intensified by climate 
change, e.g., floods, droughts, storms, and 
extreme temperatures. The data cover 167 
countries for the period 1970 to 2010. Data 
on disasters come from the Emergency 
Events Database (EM-DAT), which provides 
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the best available information on the num- 
ber of disasters and reported fatalities from 
around the world (5). 

EDUCATE FEMALES, REDUCE FATALI- 
TIES. Because the literature on disaster vul- 
nerability has conventionally emphasized 
economic growth while disregarding educa- 
tion, our statistical analysis focuses on the 
relative assessment of these two factors as 
measured by Gross Domestic Product (GDP) 
per capita and the proportion of women 
aged 20 to 39 with at least junior secondary 
education. The latter was shown to be a good 
indicator for recent improvements in human 
capital in other contexts (3). 

To account for differences in the fre- 
quency of natural hazards experienced and 
size of the countries affected, we include as 
controls the number of registered disasters 
per population, total arable land area, a 
dummy variable for being landlocked, the 
recent rate of population growth to capture 
stress on infrastructure, and 43 regional 
fixed effects for countries with comparable 
settings and climate zones. As documented 
in the supplementary materials (SM) (table 
SI and sensitivity analysis in table S2 and 
fig. SI), several alternative model specifica- 
tions combined with different estimation 
techniques resulted in very consistent find- 
ings: When estimating the relative effects of 


income and education in the same models, 
GDP per capita turns out to be insignifi- 
cant, whereas female education is highly 
significant across all models with the ex- 
pected negative sign. Hence, this empirical 
analysis of national-level time series clearly 
indicates that female education is indeed 
strongly associated with a reduction in di- 
saster fatalities. 

Assuming that this robust association 
between education and lower mortality risk 
from natural disasters tvill continue in the 
future, we present alternative scenarios for 
future disaster-related fatalities as a func- 
tion of alternative future education and 
population trends. When studying the ef- 
fects of improvements in school enrollment 
on the human capital stock of the adult 
population, it is essential to account for 
significant inertia in the process of human 
capital formation. Because primary and ju- 
nior secondary education tend to happen 
almost exclusively during childhood, it will 
take several decades until an expansion of 
education among children translates into 
higher human capital for men and women 
around age 50. This process of human 
capital formation along cohort lines can be 
appropriately modeled using the tools of 
multidimensional demography (6). 

This approach has recently been ap- 
plied to produce a new set of SSP (Shared 
Socioeconomic Pathways) scenarios for the 
international integrated assessment and 
vulnerability, risk, and adaptation research 
communities replacing the older Special 
Report on Emissions Scenarios which con- 
tained only total population size and GDP 
as socioeconomic variables (7). The SSPs 
were defined to address simultaneously the 
socioeconomic challenges to climate change 
mitigation and adaptation (S). Besides many 
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Projecting the population by level of education 
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Other economic and technological 
variahles, alternative population 
scenarios by age, sex, and seven 
levels of educational attainment 
for all countries form the “human 
core” of the full SSPs (9). SSPl il- 
lustrates the case of rapid social de- 
velopment in all parts of the world 
associated with rapidly expanding 
education (see the first chart). SSP2 
is the middle-of-the-road scenario 
where current development trends 
continue while SSPS anticipates 
a fragmented world with stalled 
socioeconomic development. The 
figure also illustrates the great 
inertia of progress in improving 
educational attainment where, by 
2035, the differences between the 
scenarios are only evident for the 
younger cohorts. 

The results of combining the estimated co- 
efficients (table SI) with two contrasting SSP 
scenarios (SSPl and SSPS) for the rest of the 
century are shown in the second chart. We did 
this by taking the time-varying population 
and education variables from the respective 
SSPs. Different assumptions were made for 
the frequency of disasters representing possi- 
ble greater future hazards. The solid lines in 
the second chart show the hypothetical case 
of constant hazard (i.e., no climate change). 
Under SSPl, this results in a significant de- 
cline of disaster deaths because of underly- 
ing progress in educational expansion. If we 
assume stalled development, which also im- 
plies higher fertility and thus higher popula- 
tion size, we observe almost no change under 
SSPS. The dashed lines assume an increase in 
the number of hydro-meteorological extreme 
events of on average 10% per decade (Climate 
Change +10%). Although there is still a slight 
reduction in future disaster deaths for SSPl, 
we observe a strong increase according to 
SSPS. The more extreme assumption of the 
hazard increasing on average by 20% per 
decade (Climate Change +20%; dotted line) 
leads to an increase in future disaster deaths 
in the longer run for all SSPs, although to dif- 
ferent degrees. 

COGNITIVE CAPACITY, SOCIAL SPILL- 
OVER. Our macrolevel finding that educa- 
tion reduces disaster-related mortality is 
consistent with evidence from recent empir- 
ical studies for different parts of the world 
and at different levels of analysis (from in- 
dividual-, household-, and community-level 
to global-level data). These studies dem- 
onstrate that education contributes to vul- 
nerability reduction and adaptive capacity 
enhancement in the predisaster phase and 
during disaster events and the disaster after- 
math [for review, see (2)]. 
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Before a disaster, disaster mitigation ef- 
forts like living in low-risk areas or undertak- 
ing disaster preparedness measures, such as 
stockpiling emergency supplies, are found to 
be greater among more highly educated in- 
dividuals and households QO). Similarly, loss 
of life, injury, morbidity, and physical dam- 
age from natural disasters were reported to 
be lower in communities and countries with 
a higher proportion of populations with at 
least a junior secondary education (fi). The 
better educated were also found to cope 
better with both income loss and the psy- 
chological impacts of natural disasters (12). 
Most of these studies explicitly compare the 
effects of education to those of household in- 
come with education consistently emerging 
as more important. Given such systemati- 
cally strong associations and a sound causal 
narrative described below, there is firm 
ground to assume functional causality of the 
effects of education on reducing vulnerabil- 
ity. This implies that a continuation of this 
association in the future can be reasonably 
assumed. 

One important mechanism through 
which education influences human well-be- 
ing is neurocognitive development. Learn- 
ing basic literacy, numeracy, and abstraction 
skills enhances cognitive capacities through 
raising the efficiency of individuals’ cogni- 
tive processes and logical reasoning (13). 
Accordingly, because preventive action is 
initiated by stressors, such as perception of 
risk, followed by assessments of one’s abil- 
ity to respond to the threat, the more edu- 
cated tend to have greater risk awareness 
because of better understanding of the con- 
sequences of their actions, e.g., as found in 
the case of smoking and cancer prevention 
(14). In addition to these individual-level 
effects, there are also spillover effects of 
education at the community level as is evi- 
dent for the effect of female education on 


lowering infant mortality (15). Op- 
portunities of social interaction with 
more-educated members may speed 
up the diffusion of information and 
knowledge, or access to institutions 
that favor disaster risk reduction. 

Of course, in our study the associa- 
tion between educational level and 
disaster vulnerability has only been 
estimated on the basis of the past 
40 years and can change in the lon- 
ger-term future because of all kinds 
of uncertainties. Instead of assuming 
different percentage changes in the 
hazard as we did, more differenti- 
ated global climate models could be 
applied. But our calculations show 
a clear picture of the strong effects 
of empowerment through education 
on reducing disaster vulnerability 
and enhancing adaptive capacity to climate 
change, which is unlikely to change when us- 
ing more sophisticated models. Accordingly, 
given uncertainty about the precise manifes- 
tations of climate change in specific areas, it 
seems beneficial to increase general flexibil- 
ity and enhance human and social capital in 
order to empower populations to better and 
more flexibly cope with climate change in a 
way best for their long-term benefit. ■ 
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Social media for large studies of behavior 


Large-scale studies of human behavior in social media need to be held to higher 
methodological standards 


By Derek Ruths'* and Jurgen Pfeffer^ 

O n 3 November 1948, the day after 
Harry Truman won the United States 
presidential elections, the Chicago 
Tribune published one of the most 
famous erroneous headlines in 
newspaper history: “Dewey Defeats 
Truman” (i, 2). The headline was informed 
by telephone surveys, which had inadver- 
tently undersampled Truman supporters 
{!). Rather than permanently discrediting 
the practice of polling, this event led to the 
development of more sophisticated tech- 
niques and higher standards that produce 
the more accurate and statistically rigorous 
polls conducted today (3). 

Now, we are poised at a similar techno- 
logical inflection point with the rise of on- 
line personal and social data for the study of 
human behavior. Powerful com- 
POLICY putatlonal resources combined 
with the availability of massive 
social media data sets has given rise to a 
growing body of work that uses a combina- 
tion of machine learning, natural language 
processing, network analysis, and statistics 
for the measurement of population struc- 
ture and human behavior at unprecedented 
scale. However, mounting evidence suggests 
that many of the forecasts and analyses be- 
ing produced misrepresent the real world 
{4-6). Here, we highlight issues that are 
endemic to the study of human behavior 
through large-scale social media data sets 
and discuss strategies that can be used to 
address them (see the table). Although some 
of the Issues raised are very basic (and long- 
studied) in the social sciences, the new kinds 
of data and the entry of a variety of com- 
munities of researchers into the field make 
these Issues worth revisiting and updating. 

REPRESENTATION OF HUMAN POPU- 
LATIONS. Population bias. A common as- 
sumption underlsdng many large-scale social 
medla-hased studies of human behavior 
is that a large-enough sample of users will 
drown out noise introduced by peculiarities 
of the platform’s population (7). However, 
substantial population biases vary across 
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different social media platforms (S). For In- 
stance, Instagram Is “especially appealing to 
adults aged 18 to 29, African-American, La- 
tinos, women, urban residents” (9) whereas 
Pinterest is dominated by females, aged 25 to 
34, with an average annual household income 
of $100,000 (10). These sampling biases are 
rarely corrected for (if even acknowledged). 

Proprietary algorithms for public data. 
Platform-specific sampling problems, for 
example, the highest-volume source of pub- 
lic Twitter data, which are used by thou- 
sands of researchers worldwide, is not an 
accurate representation of the overall plat- 
form’s data (11). Furthermore, researchers 
are left in the dark about when and how 
social media providers change the sam- 
pling and/or filtering of their data streams. 
So long as the algorithms and processes 
that govern these public data releases are 
largely dynamic, proprietary, and secret or 
undocumented, designing reliable and re- 
producible studies of human behavior that 
correctly account for the resulting biases 
will be difficult, if not impossible. Academic 
efforts to characterize aspects of the behav- 
ior of such proprietary systems can provide 
details needed to begin reporting biases. 


The rise of “embedded researchers” (re- 
searchers who have special relationships 
with providers that give them elevated ac- 
cess to platform-specific data, algorithms, 
and resources) is creating a divided social 
media research community. Such research- 
ers, for example, can see a platform’s inner 
workings and make accommodations, but 
may not be able to reveal their corrections 
or the data used to generate their findings. 

REPRESENTATION OF HUMAN BEHAV- 
IOR. Human behavior and online platform 
design. Many social forces that drive the 
formation and dynamics of human behavior 
and relations have been intensively studied 
and are well-known {12-14). For Instance, 
homophily (“birds of a feather flock to- 
gether”), transitivity (“the friend of a friend 
is a friend”), and propinquity (“those close 
by form a tie”) are all known by designers 
of social media platforms and, to increase 
platform use and adoption, have been incor- 
porated in their link suggestion algorithms. 
Thus, it may be necessary to untangle psy- 
chosocial from platform-driven behavior. 
Unfortunately, few studies attempt this. 

Social platforms also Implicitly target 


Reducing biases and flaws in social media data 

DATA COLLECTION 


• 1. Quantifies piatform-specific biases (platform design, user base, platform-specific 

behavior, platform storage policies) 

• 2. Quantifies biases of available data (access constraints, platform-side filtering) 

• 3. Quantifies proxy population biases/mismatches 

METHODS 


• 4. Applies filters/corrects for nonhuman accounts in data 


• 5. Accounts for platform and proxy population biases 

a. Corrects for platform-specific and proxy population biases 
OR 

b. Tests robustness of findings 

• 6. Accounts for platform-specific algorithms 

a. Shows results for more than one platform 
OR 

b. Shows results for time-separated data sets from the same platform 

• 7. For new methods: compares results to existing methods on the same data 

• 8. For new social phenomena or methods or classifiers: reports performance 

on two or more distinct data sets (one of which was not used during classifier 
development or design) 

Issues in evaluating data from social media. Large-scale social media studies of human behavior should I 
address issues listed and discussed herein (further discussion in supplementary materials). 
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and capture human behavior according to 
behavioral norms that develop around and 
as a result of the specific platforms. For in- 
stance, the ways in which users view Twitter 
as a space for political discourse affects how 
representative political content will be. The 
challenge of accounting for platform-specific 
behavioral norms is compounded by their 
temporal nature: They change with shifts 
in population composition, the rise and fall 
of other platforms, and current events (e.g., 
revelations concerning interest and tracking 
of social media platforms by intelligence ser- 
vices). In the absence of new methodologies, 
we must rely on assessments of where such 
entanglements likely occur. 

Distortion of human behavior. Develop- 
ers of online social platforms are building 
tools to serve a specific, practical purpose— 
not necessarily to represent social behavior 
or provide good data for research. So, the 
way data are stored and served can destroy 
aspects of the human behavior of interest. 
For instance, Google stores and reports final 
searches submitted, after auto-completion is 
done, as opposed to the text actually typed 
by the user (5); Twitter dismantles retweet 
chains by connecting every retweet back 
to the original source (rather than the post 
that triggered that retweet). There are valid, 
practical reasons for platforms to make such 
design decisions, but in many cases these 
either obscure or lose important aspects of 
the underl 5 dng human behavior. Quantifying 
and, if possible, correcting for these storage 
and access policies should be part of the data 
set reporting and curation process. 

Nonhumans in large-scale studies. Despite 
attempts by platform designers to police ac- 
counts, there are large populations of spam- 
mers and hots masquerading as “normal” 
humans on all major online social platforms. 
Moreover, many prominent individuals 
maintain social media accounts that are pro- 
fessionally managed to create a constructed 
image or even behave so as to strategically 
influence other users. It is hard to remove or 
correct for such distortions. 

ISSUES WITH METHODS. Proxy population 
mismatch. Every social media research ques- 
tion defines a population of interest: e.g., 
voting preference among California univer- 
sity students. However, because human pop- 
ulations rarely self-label, proxy populations 
of users are commonly studied instead, for 
example, the set of all Facebook users who 
report attending a UC school. However, the 
quantitative relation between the proxy and 
original populations studied, typically, is un- 
known— a source of potentially serious bias. 
A recent study revealed that this proxy effect 
has caused substantially incorrect estimates 
of political orientation on Twitter (S). 
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Incomparability of methods and data. 
With few exceptions, the terms of usage for 
social media platforms forbid the retention 
or sharing of data sets collected from their 
sites. As a result, canonical data sets for 
the evaluation and comparison of compu- 
tational and statistical methods— common 
in many other fields— largely do not exist. 
Furthermore, few researchers publish code 
implementing their methods. The result is 
a culture in which new methods are intro- 
duced (and often touted as being “better”) 
without having been directly compared to 
existing methods on a single data set. Given 


There is “...the need for 
increased awareness of what 
is actually being analyzed... “ 

platforms’ understandable sensitivity to user 
privacy and the competitive value of their 
data, the research community will likely im- 
prove method and result comparison issues 
more quickly by focusing on enforcing the 
sharing of methods at publication time. 

Multiple comparison problems. The body 
of social media analysis that concerns the de- 
velopment of user/content classification and 
prediction has unaddressed issues with over- 
fitting. Specifically, when building a com- 
putational machine that recognizes two or 
more classes (of users, for example), it is cus- 
tomary to introduce tens to hundreds of fea- 
tures as the basis for the classifier. At the very 
least, the performance of the classifier should 
take into account the number of features be- 
ing used. Of greater concern, however, is the 
extent to which the classifier performance is 
a result of “feature hunting”— testing feature 
after feature until one is found that delivers 
significant performance on the specific data 
set. Standard practices of reporting the P 
value for classifiers based on the number of 
features involved, as well as keeping a data 
set independent of the training set for final 
classifier evaluation, would work toward ad- 
dressing these issues {IS). 

Multiple hypothesis testing. In an academic 
culture that celebrates only positive findings, 
a meta-issue emerges as multiple groups 
report successes in modeling or predicting 
a specific social phenomenon. Without see- 
ing the failed studies, we cannot assess the 
extent to which successful findings are the 
result of random chance. This issue has been 
observed when predicting political election 
outcomes with Twitter {16). We are not the 
only field struggling with this issue {17). Solu- 
tions to this problem could involve enabling 
the publication of negative results or requir- 
ing the use of more data sets in a single study 


(so as to permit the calculation of a signifi- 
cance score within the study itself). 

CONCLUSIONS. The biases and issues high- 
lighted above will not affect all research in 
the same way. Well-reasoned judgment on 
the part of authors, reviewers, and editors is 
warranted here. Many of the issues discussed 
have well-known solutions contributed by 
other fields such as epidemiology, statistics, 
and machine learning. In some cases, the 
solutions are difficult to fit with practical 
realities (e.g., as in the case of proper sig- 
nificance testing) whereas in other cases the 
community simply has not broadly adopted 
best practices (e.g., independent data sets for 
testing machine learning techniques) or the 
existing solutions may be subject to biases 
of their own. Regardless, a crucial step is to 
resolve the disconnect that exists between 
this research community and other (often 
related) fields with methods and practices 
for managing anal 5 dical bias. 

Moreover, although the issues highlighted 
above all have different origins and specific 
solutions, they share in common the need 
for increased awareness of what is actually 
being analyzed when working with social 
media data. ■ 
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ECONOMICS 

First, do 
no harm 

By Robert M. May 

T his book is based on a year-long series 
of meetings and seminars, organized 
by Nicholas Morris and David Vines 
at Oxford University, that brought 
together economists, lawyers, and 
philosophers to discuss ideas for en- 
suring greater accountahility in the financial 
services sector. In particular, it addresses the 
question of why, in the hanking and finance 
community, there is no formal “duty of care” 
as there is in the legal and medical profes- 
sions. In those professions, the kind of be- 
havior manifested hy some in the financial 
community in recent years might well have 
resulted in disbarment. 

The book emphasizes that there have, of 
course, been previous banking crises in the 
United Kingdom and elsewhere, notably 
in 1792, 1825, 1857, 1907, and 1929. In ear- 
lier years, specifically before 1825, banks in 
England and Wales were owner-managed, 
with the consequence that a bank’s failure 
not only wiped out depositors but wiped out 
owners also. 

As in recent events, these earlier incidents 
put the financial system in jeopardy, and 
each prompted major reforms. It is, however, 
argued hy The Economists, Richard Davies in 
Chapter 4 that over these past two centuries 
such repeated cycles of crisis and reform 
served gradually to “drive a wedge between 
the economic interests of hanks’ managers 
and their owners and creditors.” In Chapter 
3, corporate governance expert Thomas Noe 
and economist H. Peyton Young describe 
how reforms have come to limit the risk to 
bank owners and managers. The unintended 
and perverse consequence of these reforms 
is that it has become less rational for banks 
to take full account of such risks, which can 
now be imposed on clients. 

A substantial part of the problem arguably 
lies in the textbook assumption that selfish 
motives will produce sensible outcomes, 
a belief that dates back to Adam Smith’s 
Wealth of Nations. Data, such as there are, 
supply little support for this warmly em- 
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braced belief. Like much else in economic 
theory, this particular belief is in accord 
with Robert Skidelsky’s unkind suggestion 
that economics “is a form of post-Christian 
theology, vrith economists as priests of war- 
ring sects” (7). 

In retrospect, it is perhaps not surprising 
that increasing patterns of misallocation of 
risk and reward arose and initially seemed 
acceptable. The high charges levied by hedge 
funds— 2% of the fund under management, 
plus 20% of any excess of returns above a 
benchmark— seemed acceptable in good 
times, when both the investor and interme- 
diary are receiving high returns. But in bad 
times, although the intermediary gets no 
20%, he also suffers no loss. Losses are con- 
centrated on the investor. Capital markets’ 
bonus pools work in a similar way. 


Of course, the creators of hedge funds 
and the like did not set out to create grossly 
risky products. A large part of the problem 
was that the underpinning theory upon 
which calculations of such risks were based 
was grievously flawed, a concept that is 
elaborated upon in Chapter 2. Quoting Alan 
Greenspan, the former chairman of the US. 
Federal Reserve, the authors refer to the “vir- 
tually indecipherable complexity of ... prod- 
ucts and markets that developed with the 
advent of sophisticated mathematical tech- 
niques to evaluate risks” (2). The essential 
problem is that these techniques, although 
undoubtedly complex, were certainly not 
sophisticated. 

But to what extent did banks, hedge 
funds, and other intermediaries really un- 
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derstand the risks? It is surprising that so 
little note was taken of Warren Buffett’s 
statement, clearly expressed in 2002, that 
“derivatives are financial weapons of mass 
destruction” (3). Nor was he alone in this 
perception. In his book The Big Short: In- 
side the Doomsday Machine, Michael Lewis 
describes young Investors who, after the re- 
cent financial crisis, ask, “Were these guys 
crooks or stupid?” An older and 
darker character answers, “Most 
of them were stupid, but some 
of them were crooks, and they 
tended to be higher up” (4). 

As noted in Capital Failure, and 
elsewhere by the Banking Com- 
mission, one important question 
is whether the competence and 
behavior of individuals working 
within the system would benefit 
from a professional oversight or- 
ganization. In Chapter 9, legal 
scholar Joshua Getzler observes 
that, even with the benefit of retro- 
spect, Goldman Sachs’s executives 
had difficulty acknowledging that 
it was inherently wrong to have 
the bank aggressively marketing 
investments to clients while taking 
a short position on the same in- 
vestments, thus taking handsome 
remuneration from both sides. Al- 
though it was at one time thought 
possible that some of the more 
egregious instances of such unpro- 
fessional behavior might result in 
jail sentences, this has not happened. The 
popular explanation for this is that mecha- 
nisms for such legal action are nonexistent; 
however, this argument has recently and de- 
cisively been demolished (5). 

REFERENCES 

1. R.Skidelsky, John Maynard Keynes: 1883-1946: 
Economist, Philosopher, Statesman (Penguin, London, 
2005). 

2. A. Greenspan, Brook/ngs Pap. Econ. Act. 41, 201-246 
( 2010 ). 

3. W. E. Buffett, Letter to the Shareholders of Berkshire 
Hathaway Inc. (2003); www.berkshirehathaway.com/ 
letters/2002pdf.pdf. 

4. M. Lewis, The Big Short: Inside the Doomsday Machine 
(Norton, New York, 2010). 

5. J. S. Rakoff, The financial crisis: Why have no high-level 
executives been prosecuted? New York Rev. Books (9 
January 2014). 

10.1126/sclence.l260794 



SCIENCE sciencemag.org 


28 NOVEMBER 2014 ■ VOL 346 ISSUE 6213 10 65 


Published byAAAS 



PUBLIC HEALTH 


The tangled web of dengue fever 


By Kristin Harper 

» ccording to the U.S. Centers for Dis- 
ease Control and Prevention, one- 
third of the world’s population is at 
risk for contracting dengue fever (1), 
a mosquito-borne viral infection so 
painful it is sometimes referred to as 
“breakbone fever.” Although only nine coun- 
tries experienced severe dengue epidemics 
before 1970, today the disease is endemic in 
more than 100 countries (2). These dire sta- 
tistics are familiar to many in public health, 
but the story of dengue’s rise in the 21st 
century is less well known. Alex M. Nading’s 
Mosquito Trails: Ecology, Health, and the 
Politics of Entanglement is a timely explora- 
tion of dengue prevention efforts in Ciudad 
Sandino, a slum outside Nicaragua’s capital 
city, Managua. 

To tell this story, Nading, an anthropolo- 
gist by training, embedded himself in the 
lives of those on the front lines of the dengue 
epidemic. His interviewees range from the 
community health workers known as briga- 
distas, who go from house to house stamping 
out mosquito breeding grounds, to the gar- 
bage collectors who have become scapegoats 
for the epidemic, implicated for their role in 
providing mosquito breeding places. He also 
spent time with the epidemiologists charged 
with implementing control efforts and fami- 
lies affected by dengue. 

Nading offers a much-needed discussion 
of the broader context in which the den- 
gue epidemic is taking place. An environ- 
ment tailor-made for the virus and its vector 
has been produced by the global trade in 
garbage, the growth of peri-urban slums, 
changes in insecticide use, and an approach 
to public health that increasingly shifts the 
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responsibility for mosquito control from the 
government to individual households. This 
book provides a fascinating, if meandering, 
tour of dengue and the changing face of pub- 
lic health in Nicaragua from the 1979 San- 
dinista revolution up to the current era of 
structural readjustment. 

Nading devotes a substantial part of the 
book to discussing “entanglement,” de- 
fined as “the unfolding, often incidental 
attachments and affinities, antagonisms 
and animosities that bring people, nonhu- 
man animals, and things into each other’s 
worlds.” Certainly, his descriptions of life in 
Ciudad Sandino illustrate the dense web of 
interactions between people, places, politics, 
mosquitos, and the dengue virus, provid- 
ing insight into why control is so challeng- 
ing. Unfortunately, the lengthy sections of 
the book devoted to entanglement are of- 
ten convoluted and disappointing when it 
comes to potential solutions. For example, 
Nading calls for a solution in which “basic 
and complex infrastructures, national and 
transnational belonging, and, above all, a 
recognition that environments and bod- 
ies are entangled, lead to a more relational 
ethic of health” but falls to elaborate on what 
such an approach might look like. Simi- 
larly, he argues that factors such as climate 
change and uneven urban development ne- 
cessitate a “holistic, even radical” solution 
to mosquito control, but again, we are left 
to wonder what such a solution might look 
like. Ultimately, it is not clear how entangle- 
ment, as an analytical framework, represents 
a conceptual advance over simply describing 
dengue control as complicated. 

Nading concludes the book with three 
practical considerations for those who would 
confront dengue. The first is the idea that 
politics and poverty are critical components 
of the disease process. The second is that 
community health workers are underpaid 
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and underappreciated. His final recommen- 
dation for tackling vector-borne diseases is 
to consider the relationships between people 
and other creatures. All of these conclusions 
are well supported by the book’s vignettes. 
However, while Nading considers disrup- 
tion of current approaches to be a goal of his 
framework of entanglement, his conclusions 
are consistent with ideas long espoused by 
global health institutions, including the 
World Health Organization and the Rock- 
efeller Foundation {3, 4). 

Although this book is light on solutions 
to the dengue epidemic, researchers and 
clinicians involved in dengue research and 
care, medical anthropologists, and anyone 
interested in the intersection of politics and 
global health will likely find this a worth- 
while read. Mosquito Trails documents the 
many challenges inherent in Nicaragua’s 
struggle with dengue and gives a voice to the 
residents of Ciudad Sandino who have found 
themselves entangled with virus, vector, and 
one other. 
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Edited by Jennifer Sills 

Salamander protection 
starts with the newt 

IN THEIR REPORT “Recent introduction 
of a ciiytrid fungus endangers Western 
Palearctic salamanders” (31 October, p. 630), 
A. Martel et al. warned that a newly 
discovered pathogenic ch 3 Trid fungus 
Batrachochytrium salamandrivorans (Bs) 
likely originated in Asia and poses a threat 
to naive European and North American 
salamander populations. However, they did 
not address the conservation crisis facing 
the apparent vector of Bs, Asian salaman- 
ders in the family Salamandridae (“newts”). 

Eight years ago, one of us expressed 
concern for the fate of the Lao Newt (now 
Laotriton laoensis), a newly discovered 
species in Laos that was targeted by com- 
mercial collectors from Germany and Japan 
after it was inadvertently “advertised” by its 
formal description in a scientific journal (J). 
The species is now endangered in the wild 
(2), primarily due to continued overhar- 
vesting for the international pet trade (3). 
Laotriton is a member of the same phylo- 
genetic clade that contains the three Aslan 


species [genera Paramesotriton 
and Cynops (4)] that were 
identified by Martel et al. as being 
reservoirs for Bs. Our efforts to curb 
the commercial trade of Laotriton 
in Laos have had limited success. 

In 2008, the Lao government 
commendably listed it as the first 
amphibian species in the country to 
be protected from commercial trade 
(3). Unfortunately, Laotriton are small 
and easily smuggled across Laos’s porous 
international borders, and wildlife law 
enforcement in this relatively poor country 
is hindered by lack of resources (5). More 
than one-third of all Southeast and East 
Asian newt species are at risk of extinc- 
tion (2). Many of these are threatened by 
overharvesting for the international pet 
trade (6), as interest in keeping Asian newts, 
particularly in the West, continues to grow 
(7). Last year, Laotriton was promoted as a 
pet on the cover of a U.S.-based amphibian 
and reptile hobbyist magazine that has wide 
international readership (S). 

Martel et alls findings are foreboding for 
wild populations of European and North 
American salamanders. At the same time, 
wild Laotriton and other Asian newt popu- 
lations are in dire need of better protection 
from international trade. Perhaps Martel 
et al.’s findings provide a glimmer of hope 
that exporting and importing countries 
can simultaneously address these threats, 1 


albeit for different reasons. By 
stemming the trade of Asian 
newts to Europe and North 
America, risk of catastrophic 
extirpations of global salamander 
populations decreases: Asian 
newts remain in the wild, and 
naive European and North 
American salamanders are 
not exposed (at least further) to 
a pathogenic fungus that Martel 
et al. inferred has been restricted to Asia 
since the Paleogene. Source countries in 
Asia and destination countries in the West 
should work hand in hand to control the 
international trade of Asian newts. No 
Asian newt species are currently protected 
by the Convention on International Thade 
in Endangered Species of Wild Fauna and 
Flora (CITES), an international agreement 
that is intended to safeguard certain spe- 
cies from over-exploitation (9), but many 
probably should be. And importation of live 
Asian newts should be controlled in the 
same manner as livestock and poultry that 
pose disease threats to domestic popula- 
tions. Persistence of wild salamanders on 
both sides of the planet may depend on it. 
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ONLINE BUZZ: HUMAN EVOLUTION 

Roots of self-domestication 

IN HER IN DEPTH NEWS Story “How we tamed ourselves— and 
became modern” (24 October, p. 405), A. Gibbons reported 
on a symposium about human self-domestication, a process 
that allowed increased cooperation and the development of 
cognitive skills such as complex language. Readers expressed 
interest in the topic in the online comments section. Excerpts 
from their responses are below. Read the full comments, 
and add your own, at http://comments.sciencemag.org/ 
content/10.1126/science.346. 6208.405. 

A selection of your thoughts: 

...the idea that the process of killing sexually aggressive men 
was a driver [of self-domestication] seems flawed, in that many 
societies need to tolerate aggression as a protective factor, 
and dominance of one male over others has been a selection 
process for community leadership.... 

Philip Benjamin 

...One of the story’s main conclusions is that “selection 
favored males who could work together.”... In my opinion, 
advantage (in strength or in intelligence) over others..., not 


cooperation, is the feature favored by selection. Cooperation 
is one possible tool to achieve private aims (...for instance, 
politicians in democratic countries have to participate in 
elections [rather than making war as a way of attaining 
office]....). Maybe we are less aggressive (in a physical sense) 
in population centers but.. .our “self-domestication”... involves 
controlling the behaviors of other humans. 

Konrad Szocik 

...Research in this area should be heavily funded in order to 
help us to understand and mitigate the catastrophic deleterious 
effects of some.. .human communities [today].... 

Silvio Pitlik 

I think it may be important to consider the possibility that 
“selection for... facial feminization” can be rephrased as “selec- 
tion for retention of facial (and other) traits of the young of 
the species in general, whether female or male.” My reasoning 
is that retention of such general “childlike” traits (neotony), 
which engender stronger feelings and behaviors of tenderness, 
care, and bonding in adults of the species toward their children, 
would Itself go a long way toward reducing intraspecies aggres- 
sion and increasing the possibilities of cooperation, even far 
outside the usual family unit.... 

GenePartlow 
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China's ecological 
steps forward 

IN THEIR REPORT “A mid-term analysis of 
progress toward international biodiversity 
targets” (10 October, p. 241; published 
online 2 October), D. P. Tittensor and 


colleagues concluded that despite acceler- 
ating policy and management responses 
to the biodiversity crisis on a global scale, 
these efforts are unlikely to improve the 
state of biodiversity by 2020. They did 
not acknowledge that sustained national 
policies in China have already produced 
positive effects on biodiversity. 

The Chinese government initiated 
in 1999 the Natural Forest Resources 
Conservation Program and the Restoring 
Farmland into Forest Program. Logging 
has been prohibited in most natural 
forests, and cultivated land on areas with 
slopes of more than 25 degrees must be 
restored to forests or grasslands (1). The 
central government subsidized forest 
management and conservation, as well 
as seedling cultivation and reforestation 
(2, 3). Households that have returned 
their cultivated land to forests received 
subsidies from the central government (2, 
3). Recently, the State Council updated the 
policies and increased subsidies to further 
promote ecosystem restoration (2). 

A number of other key ecological pro- 
grams continue to be implemented, such 
as forest belt construction in the Yangtze 
River basin. More than US$80 billion 


has been Invested in these programs (i). 
Ecological conditions have improved 
(2). Forest area, growing stock (the total 
stock volume of trees growing in land), 
and coverage rate (the percentage of area 
of afforested land compared with total 
land) all increased between 2009 and 
2013 (4-6). These programs have con- 
tributed to progress toward Aichi Targets 
5, 14, and 15 (2), although these policies 
need to be fine-tuned to best fit the local 
environment. 
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As science 
grows in China, 
ethics education 
aims to keep up 



Graduate students attend a public lecture on research ethics at the Great Hall of the People, in Beijing. 


By Brian Lin and Becky Ham 

massive national effort in China 
to improve scientific integrity has 
brought the issue to the attention of 
a staggering number of researchers 
in that country: In less than 4 years, 
more than 8 million undergradu- 
ate and graduate students have listened to 
20,000 lectures on the topic. 

But experts at a recent meeting held at 
AAAS said that ethics education should 
start even earlier for Chinese and American 
researchers and that such education faces 
stiff challenges from a research environ- 
ment that, in some instances, serves as a 
breeding ground for authorship disputes 
and misuse of data, among other problems. 

With international scientific collabora- 
tions on the rise, the issue of integrity is 
increasingly important, the meeting’s par- 
ticipants agreed. “The only way the scien- 
tific enterprise can contribute to worldwide 
problems is for... scientists to be willing, 
able, and enthusiastic about collaborating 
on a worldwide scale,” said AAAS CEO Alan 
I. Leshner. “Central to any collaboration are 
issues like trust, shared values, norms, and 
standards that drive the practice.” 

The China-U.S. Scientific Morality/Integ- 
rity Development Seminar was the fourth 
such meeting convened since the Scientists’ 
Social and Ethical Responsibilities confer- 
ence organized by the China Association 
for Science & Technology (CAST) and AAAS 
in 2007. Since then, CAST and AAAS have 
collaborated on several workshops on sci- 
entific Integrity, and in 2010 established a 
joint steering committee to coordinate eth- 
ics work. 

Total research funding for Chinese univer- 
sities ballooned from US$7.6 billion in 2009 
to US$12.7 billion in 2012, said Guangxian 
Li, the executive vice president of Sichuan 
University, who spoke at the 8 to 9 October 
seminar on behalf of Shen Yan, vice chair- 
man of CAST and deputy director of the Na- 
tional Natural Science Foundation of China. 
Research papers published by Chinese uni- 
versities saw a 10% increase— to more than 
1.1 million— during the same period. 
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The desire to achieve higher international 
standings has put enormous pressures on 
Chinese researchers to publish, said Li. The 
CJS World University Rankings and Times 
Higher Education World University Rank- 
ings, for example, base 60 to 70% of their 
scores on research and citations. 

But a national Chinese survey of more 
than 30,000 Chinese researchers found 51% 
of respondents admitted to frequently or oc- 
casionally improperly attributing sources, 
while 42% submitted manuscripts to mul- 
tiple journals. Another survey of nearly 5500 
graduate students from 24 universities, con- 
ducted a year later, revealed a similar trend. 

An ambitious national campaign launched 
in 2011 by the Chinese Ministry of Education, 
Li said, is raising awareness of these issues. 
Six thousand graduate students from 60 uni- 
versities have attended annual lectures on re- 
search ethics, held at the iconic Great Hall of 
the People, in Beijing. But education should 
start even earlier in a student’s career, said 
Diange Yang, a professor in Tsinghua Univer- 
sity’s department of automotive engineering 
and deputy dean responsible for research 
work and international cooperation. 

“Strengthening academic integrity edu- 
cation of graduate students... should be the 
responsibility of the whole society, and it 
should be done from childhood,” said Yang. 

The United States is unlikely to hold na- 
tional ethics lectures as has happened in 
China, but AAAS has joined the National Sci- 
ence Foundation and the Office of Research 
Integrity in conducting ethics education for 
scientists, said Mark S. Frankel, director of 
the AAAS Scientific Responsibility, Human 


Rights and Law Program. By the end of the 
year, AAAS will post on its website seven case 
studies from a 2012 workshop that address 
issues of authorship, conflict of interest, 
collaboration, and plagiarism. And Frankel, 
Leshner, and Yang Wei, director of China’s 
National Natural Science Foundation, are co- 
authors of a chapter on Chinese and Ameri- 
can research integrity and ethics education 
that will be published in the “Handbook of 
Academic Integrity” in 2015. 

Though essential, education in responsible 
research conduct is not enough, said Philip 
Langlais, professor of psychology and former 
vice provost of graduate studies and research 
at Old Dominion University. A 2006 survey 
he conducted with 222 faculty and 534 grad- 
uate students in the United States found that 
while up to 90% of faculty reported provid- 
ing training in responsible research conduct, 
up to 35% of students claimed they received 
no training. Only 8% of students and faculty 
reported having discussed and reached con- 
sensus on topics such as authorship, owner- 
ship, and use of data. 

“We’ve been targeting individuals, but 
not other layers of influence,” Langlais 
said. “Scientists who have received proper 
training in responsible conduct of research 
have violated best practices and principles 
of scientific integrity as a result of a ‘toxic’ 
research environment” of higher workloads, 
shifting university priorities, and increasing 
competition for funding. 

In both China and the United States, “you 
can’t change climate and culture overnight,” 
he said. “But, you can make small, incre- 
mental changes.” ■ 
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AAAS members elected as Fellows 

In October 2014, the AAAS Council elected 401 members as Fellows of AAAS. These indi- 
viduals will be recognized for their contributions to science and technology at the Fellows 
Forum to be held on 14 February 2015 during the AAAS Annual Meeting in San Jose, California. 
Presented by section affiliation, they are: 


Section on Agriculture, Food, and 
Renewable Resources 

Andrew F. Bent, Univ. of WIsconsin-Madison 
Douglas R. Call, Washington State Univ. 
Andrew D. Hanson, Univ. of Florida 
Daniel A. Herms, Ohio State Univ. 

Ann M. Hirsch, Univ. of California, Los 
Angeles 

Harold Corby Kistler, USDA Agricultural 
Research Service 
Ian H. Mather, Univ. of Maryland 
Bruce A. McPheron, Ohio State Univ. 

Charles Michael Smith, Kansas State Univ. 
Christopher K. Tuggle, Iowa State Univ. 

Diane E. Ullman, Univ. of California, Davis 
Esther van der Knaap, Ohio State Univ. 

Section on Anthropology 

David G. Anderson, Univ. of Tennessee, 
Knoxville 

Robert L. Anemone, Univ. of North Carolina 
at Greensboro 

Robert G. Franciscus, Univ. of Iowa 
Dale L. Hutchinson, Univ. of North Carolina 
at Chapel Hill 

Bertis Britt Little, Tarleton State Univ. 

Dwight W. Read, Univ. of California, Los 
Angeles 

Daniel H. Sandweiss, Univ. of Maine 
Pauline W. Wiessner, Univ. of Utah 

Section on Astronomy 

Tom Abel, Stanford Univ./Kavli Institute 
for Particle Astrophysics & Cosmology/ 
SLAC National Accelerator Laboratory 
Andreas J. Albrecht, Univ. of California, Davis 
Stanislav George Djorgovski, California 
Institute of Technology 
Martin S. Elvis, Harvard-Smithsonian Center 
for Astrophysics 

Paul G. Kalas, Univ. of California, Berkeley/ 
SETI Institute 

Margaret Meixner, Space Telescope Science 
Institute/Johns Hopkins Univ. 

Section on Atmospheric and 
Hydrospheric Sciences 

William D. Collins, Lawrence Berkeley 
National Laboratory/Univ. of California, 
Berkeley 

Kelvin K. Droegemeier, Univ. of Oklahoma 
Willard S. Moore, Univ. of South Carolina 
DrewT. Shindell, Duke Univ. 

Erank J. Wentz, Remote Sensing Systems 
Diane E. Wickland, NASA 

Section on Biological Sciences 

Alan A. Aderem, Seattle Biomedical 
Research Institute 


Russ B. Altman, Stanford Univ. 

Scott Allen Armstrong, Memorial Sloan 
Kettering Cancer Center 
Bernard P. Arulanandam, Univ. of Texas at 
San Antonio 

Alan D. Attie, Univ. of Wisconsin-Madison 
David Mansfield Bader, Vanderbilt Univ. 
School of Medicine 

Gregory F. Ball, Univ. of Maryland, College 
Park 

Marisa Susan Bartolomei, Univ. of Pennsyl- 
vania Perelman School of Medicine 
Steven A. Benner, Foundation for Applied 
Molecular Evolution 

Christoph Benning, Michigan State Univ. 

Aviv Bergman, Albert Einstein College of 
Medicine 

Jeffrey D. Blaustein, Univ. of Massachusetts 
Amherst 

Richard Merrill Breyer, Vanderbilt Univ. 

Craig Eugene Cameron, Pennsylvania State 
Univ. 

Lon R. Cardon, GlaxoSmithKline 
Douglas R. Cavener, Pennsylvania State 
Univ. 

Aravinda Chakravarti, Johns Hopkins Univ. 
School of Medicine 

Cheng-Ming Chuong, Univ. of Southern 
California 

Gino A. Cortopassi, Univ. of California, Davis 
Robert J. Cousins, Univ. of Elorida 
Richard D. Cummings, Emory Univ. School 
of Medicine 

Stephen P. Daiger, Univ. of Texas Health 
Science Center at Houston 
Ann Dean, National Institute of Diabetes and 
Digestive and Kidney Diseases/NIH 
Lynn Eleanor DeLisi, VA Boston Healthcare 
System/Harvard Medical School 
Ronald A. DePinho, Univ. of Texas MD 
Anderson Cancer Center 
Burton F. Dickey, Univ. of Texas MD 
Anderson Cancer Center 
Valerian V. Doija, Oregon State Univ. 

David C. Dorman, North Carolina State Univ. 
Gregory Roland Dressier, Univ. of Michigan 
Daniela Drummond-Barbosa, Johns 
Hopkins Univ. 

Josee Dupuis, Boston Univ. School of Public 
Health 

Geoffrey M. Duyk, Texas Pacific Group (TPG) 
Irene Anne Eckstrand, National Institute of 
General Medical Sciences/NIH 
Sean R. Eddy, HHMI Janelia Research 
Campus 

Alan N. Engelman, Dana-Farber Cancer 
Institute 

Peter J. Espenshade, Johns Hopkins Univ. 
School of Medicine 


Rosann A. Farber, Univ. of North Carolina at 
Chapel Hill 

Joanna Floros, Pennsylvania State Univ. 

College of Medicine 
Jeffrey M. Friedman, Rockefeller Univ. 

Haian Fu, Emory Univ. School of Medicine 
David J. Garfinkel, Univ. of Georgia 
Deborah E. Goldberg, Univ. of Michigan 
Margaret A. Goodell, Baylor College of 
Medicine 

Dan Graur, Univ. of Houston 
Marilyn R. Gunner, City College of New York 
Mamie E. Halpern, Carnegie Institution for 
Science 

T. Kendall Harden, Univ. of North Carolina at 
Chapel Hill 

A. Wallace Hayes, Harvard Univ. 

Chuan He, Univ. of Chicago 
John R. Hepler, Emory Univ. School of 
Medicine 

Oliver Hobert, Columbia Univ. Medical 
Center 

Stefan Hohmann, Univ. of Gothenburg 
(Sweden) 

Richard L. Hoover, Vanderbilt Univ. Medical 
Center 

Ya-Ming Hou, Thomas Jefferson Univ. 
Elizabeth Ehrhardt Howell, Univ. of 
Tennessee, Knoxville 
Trey Ideker, Univ. of California, San Diego 
W. Gray Jerome, Vanderbilt Univ. School of 
Medicine 

Teh-hui Kao, Pennsylvania State Univ. 

Jack H. Kaplan, Univ. of Illinois at Chicago 
Paul Andrew Karplus, Oregon State Univ. 
Tom Klaus William Kerppola, Univ. of 
Michigan 

Shohei Koide, Univ. of Chicago 
Stephen F. Konieczny, Purdue Univ. 

Bruce R. Korf, Univ. of Alabama at 
Birmingham 

Robert G. Kranz, Washington Univ. in St. 
Louis 

Lee Kroos, Michigan State Univ. 

David Landsman, National Center for 
Biotechnology Information/NIH 
David H. Ledbetter, Geisinger Health System 
Brendan Lee, Baylor College of Medicine 
Michael J. Leibowitz, Univ. of California, 

Davis 

Norman G. Lewis, Washington State Univ. 
Erwin London, Stony Brook Univ. 

Manyuan Long, Univ. of Chicago 
William L. Lowe Jr., Northwestern Univ. 

Feinberg School of Medicine 
Bruce J. MacFadden, Florida Museum of 
Natural History, Univ. of Florida 
Nicholas Gordon Martin, Queensland Insti- 
tute of Medical Research (Australia) 
Andrea M. Mastro, Pennsylvania State Univ. 

U. Thomas Meier, Albert Einstein College of 
Medicine 

Karen L. Mohike, Univ. of North Carolina at 
Chapel Hill 

Andrew W. Murray, Harvard Univ. 
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Jeffrey C. Murray, Univ. of lowa/Bill and 
Melinda Gates Foundation 
David L. Nelson, Baylor College of Medicine 
Phillip Allan Newmark, Howard Hughes 
Medical Institute/Univ. of Illinois at 
Urbana-Champaign 

Ian C.T. Nisbet, I.C.T. Nisbet and Company 
(retired) 

Michael J. Oglesbee, Ohio State Univ. 

Mark D. Ohman, Scripps Institution of 
Oceanography/Univ. of California, 

San Diego 

Ian Michael Orme, Colorado State Univ. 

Mary Ann Ottinger, Univ. of Houston 
Tao Pan, Univ. of Chicago 
Patricia Ann Peyser, Univ. of Michigan 
School of Public Health 
Sara C. Pryor, Cornell Univ. 

B. Franklin Pugh, Pennsylvania State Univ. 
Richard Anthony Rachubinski, Univ. of 
Alberta (Canada) 

Danny F. Reinberg, New York Univ. 

Daniel Martin Roberts, Univ. of Tennessee, 
Knoxville 

Roy Martin (Marty) Roop II. East Carolina 
Univ. School of Medicine 
Jocelyn Kenneth Campbell Rose, Cornell 
Univ. 

Barry Philip Rosen, Florida International 
Univ. Herbert Wertheim College of 
Medicine 

Stanley J. Roux, Univ. of Texas at Austin 
Peter A. Rubenstein, Univ. of Iowa 
Paolo Sassone-Corsi, Univ. of California, 
Irvine 

Todd A. Schlenke, Reed College 
Jon Seger, Univ. of Utah 
K. Krishna Sharma, Univ. of 
Missouri-Columbia 

Sanjay S. Shete, Univ. of Texas MD Anderson 
Cancer Center 

Robert H. Singer, Albert Einstein College 
of Medicine/HHMI Janelia Research 
Campus 

Michael Snyder, Stanford Univ. 

Benjamin C. Stark, Illinois Institute of 
Technology 

Rainer Storb, Ered Hutchinson Cancer 
Research Center 
William S. Talbot, Stanford Univ. 

Stephen E. Ullrich, Univ. of Texas MD 
Anderson Cancer Center 
Karen M. Vasquez, Univ. of Texas at Austin 
Leslie B. Vosshall, Rockefeller Univ. 

Pamela J. Weathers, Worcester Polytechnic 
Institute 

Theodore G. Wensel, Baylor College of 
Medicine 

Mark Winey, Univ. of Colorado Boulder 
Edward W. Yu, Iowa State Univ. 

Section on Chemistry 

Nancy L. Allbritton, Univ. of North Carolina 
at Chapel Hill 

Jerry L. Atwood, Univ. of Missouri-Columbia 
Bill J. Baker, Univ. of South Florida 


Thomas Edwin Bitterwolf, Univ. of Idaho 
Bruce S. Brunschwig, California Institute of 
Technology 

Michelle V. Buchanan, Cak Ridge National 
Laboratory 

Jef Karel De Brabander, Univ. of Texas 
Southwestern Medical Center at Dallas 
Henry C. Eoley, Univ. of Missouri-Columbia 
Heinz M. Frei, Lawrence Berkeley National 
Laboratory 

Michael T. Green, Pennsylvania State Univ. 
Brian A. Gregg, National Renewable Energy 
Laboratory 

Patrick L. Holland, Yale Univ. 

Timothy F. Jamison, Massachusetts Institute 
of Technology 

David M. Jonas, Univ. of Colorado Boulder 
Christopher W. Jones, Georgia Institute of 
Technology 

Silvia Sabine Jurisson, Univ. of 
Missouri-Columbia 

Christine Dolan Keating, Pennsylvania State 
Univ. 

John Z. Larese, Univ. of Tennessee, Knoxville 
Richard Alan LeSar, Iowa State Univ. 
Arumugam Manthiram, Univ. of Texas at 
Austin 

Thomas J. Meade, Northwestern Univ. 

Deane Fremont Mosher Jr., Univ. of 
Wisconsin-Madison 
Vincent T. Remcho, Cregon State Univ. 

David Duncan Roberts, National Cancer 
Institute/NIH 

Laurel L. Schafer, Univ. of British Columbia 
(Canada) 

Charles Albert Schmuttenmaer, Yale Univ. 
Richard R. Schrock, Massachusetts Institute 
of Technology 

Yang Shao-Horn, Massachusetts Institute of 
Technology 

C. David Sherrill, Georgia Institute of 
Technology 

Mikiko Sodeoka, RIKEN (Japan) 

MarkT. Spitler, U.S. Department of Energy 
George G. Stanley, Louisiana State Univ. 
Judith Stein, GE Global Research (Retired) 
JoAnne Stubbe, Massachusetts Institute of 
Technology 

Lai-Xi Wang, Univ. of Maryland 
Shi-Qing Wang, Univ. of Akron 
Bert Weckhuysen, Utrecht Univ. 
(Netherlands) 

David W. Wright, Vanderbilt Univ. 

Jin Zhang, Johns Hopkins Univ. School of 
Medicine 

Section on Dentistry and Oral Health 
Sciences 

Matthew J. Doyle, The Procter & Gamble 
Company 

Cphir David Klein, Univ. of California, San 
Francisco 

Ann Progulske-Fox, Univ. of Florida 

Section on Education 

Cynthia Margaret Bauerle, Howard Hughes 
Medical Institute 


Bianca L. Bernstein, Arizona State Univ. 
Elizabeth S. Boylan, Alfred P. Sloan 
Eoundation 

Elaine A. Johnson, Bio-Link/City College of 
San Francisco 

David E. Lopatto, Grinnell College 
Patricia A. Marsteller, Emory Univ. 

David Hillyer Voorhees, Waubonsee 
Community College 

Daniel A. Wubah, Washington and Lee Univ. 

Section on Engineering 

John S. Baras, Univ. of Maryland 
Edward J. Berbari, Indiana Univ. Purdue Univ. 
Indianapolis 

Roger T. Bonnecaze, Univ. of Texas at Austin 
Lance R. Collins, Cornell Univ. 

Placid M. Eerreira, Univ. of Illinois at 
Urbana-Champaign 

Shekhar Garde, Rensselaer Polytechnic 
Institute 

Brendan A. Harley, Univ. of Illinois at 
Urbana-Champaign 
Gilbert V. Herrera, Sandia National 
Laboratories 

Mark C. Hersam, Northwestern Univ. 

Yun Hang Hu, Michigan Technological Univ. 
David A. Kofke, Univ. at Buffalo, SUNY 
K. Lu, Chinese Academy of Sciences 
Joseph William Lyding, Univ. of Illinois at 
Urbana-Champaign 
Manos Mavrikakis, Univ. of 
Wisconsin-Madison 
Karen Anne Moxon, Drexel Univ. 

Debbie A. Niemeier, Univ. of California, Davis 
Christopher Kemper Cber, Cornell Univ. 
Vilupanur A. Ravi, Cal Poly Pomona 
Clinton T. Rubin, Stony Brook Univ. 

Ka-Yiu San, Rice Univ. 

Wolfgang M. Sigmund, Univ. of Florida 
Igal Szleifer, Northwestern Univ. 

George Alexander Truskey, Duke Univ. 
Marjolein Christine H. van der Meulen, 
Cornell Univ. 

Gordana Vunjak-Novakovic, Columbia Univ. 
Norman J. Wagner, Univ. of Delaware 
Ge Wang, Rensselaer Polytechnic Institute 
Robert K. Whitman, Univ. of Denver 
Jessica 0 . Winter, Chio State Univ. 
Mohammed A. Zikry, North Carolina State 
Univ. 

Section on General Interest in Science 
and Engineering 

John Katzenberger, Aspen Global Change 
Institute 

Mary K. Miller, Exploratorium, San Francisco 
Peter N. Spotts, The Christian Science 
Monitor 

Section on Geology and Geography 

Rodey Batiza, National Science Foundation 
Hilary H. Birks, Univ. of Bergen (Norway) 
Steven C. Cande, Univ. of California, San 
Diego 

Kenneth H. Coale, Moss Landing Marine 
Laboratories 
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Thomas Cronin, U.S. Geological Survey 
Rane L. Curl, Univ. of Michigan 
Samantha B. Joye, Univ. of Georgia 
Michael Keller, USDA Forest Service 
Kenneth R Kodama, Lehigh Univ. 

Daniel A. Lashof, Natural Resources Defense 
Council 

David Lopez-Carr, Univ. of California, Santa 
Barbara 

Derek R. Lovley, Univ. of Massachusetts 
Amherst 

Melanie A. Mayes, Oak Ridge National 
Laboratory 

Nora Noffke, Old Dominion Univ. 

Harry H. Roberts, Louisiana State Univ. 
George D. Stanley Jr., Univ. of Montana 
Brian William Stump, Southern Methodist 
Univ. 

Thomas William Swetnam, Univ. of Arizona 
James W. C. White, Univ. of Colorado Boulder 
Carol M. Wicks, Louisiana State Univ. 

Section on History and Philosophy of 
Science 

Cathryn Carson, Univ. of California, Berkeley 
Robert J. Malone, History of Science 
Society/Univ. of Notre Dame 
Joseph C. Pitt, Virginia Tech 
C. Kenneth Waters, Univ. of Minnesota 

Section on Industrial Science and 
Technology 

Cammy R. Abernathy, Univ. of Florida 
Liyuan Liang, Oak Ridge National Laboratory 
J. Stephen Rottler, Sandia National 
Laboratories 

Justin Schwartz, North Carolina State Univ. 

Section on Information, Computing, 
and Communication 

Francine Berman, Rensselaer Polytechnic 
Institute 

Ahmed K. Elmagarmid, Qatar Computing 
Research Institute, Qatar Foundation 
Thomas D. Garvey, SRI International 
Peter D. Karp, SRI International 
Madhav V. Marathe, Virginia Tech 
Eric Mjolsness, Univ. of California, Irvine 
Dan Roth, Univ. of Illinois at 
Urbana-Champaign 
William H. Sanders, Univ. of Illinois at 
Urbana-Champaign 
Thomas L. Sterling, Indiana Univ. 

Section on Linguistics and Language 
Sciences 

Ralph W. Fasold, Georgetown Univ. 

Lise Menn, Univ. of Colorado Boulder 
Pamela Munro, Univ. of California, Los 
Angeles 

Section on Mathematics 

James M. Crowley, Society for Industrial and 
Applied Mathematics 
Charles L. Epstein, Univ. of Pennsylvania 
Natasa Jonoska, Univ. of South Elorida 
Kirk E. Jordan, IBM Research Division 


Yuri Tschinkel, New York Univ. 

Howard (Howie) Weiss, Georgia Institute of 
Technology 

Section on Medical Sciences 

Lynne V. Abruzzo, Qhio State Univ. 

Naji N. Abumrad, Vanderbilt Univ. School of 
Medicine 

Rexford S. Ahima, Univ. of Pennsylvania 
Perelman School of Medicine 
Andrew Arnold, Univ. of Connecticut School 
of Medicine 

Linda Gwen Baum, Univ. of California, Los 
Angeles Geffen School of Medicine 
Jeremy M. Boss, Emory Univ. School of 
Medicine 

Myles A. Brown, Dana-Farber Cancer 
Institute 

David Michael Center, Boston Univ. Medical 
Center 

Jeffrey S. Chamberlain, Univ. of Washington 
School of Medicine 

Bandana Chatterjee, Univ. of Texas Health 
Science Center at San Antonio 
Xinbin Chen, Univ. of California, Davis 
Jeffrey I. Cohen, National Institute of Allergy 
and Infectious Diseases/NIH 
Ronald B. Corley, Boston Univ. School of 
Medicine 

Christopher M. Counter, Duke Univ. Medical 
Center 

Jules L. Dienstag, Harvard Medical School 
Mary C. Dinauer, Washington Univ. School of 
Medicine in St. Louis 

Jeffrey M. Drazen, New England Journal of 
Medicine 

Garth D. Ehrlich, Drexel Univ. College of 
Medicine 

Agnes B. Fogo, Vanderbilt Univ. Medical 
Center 

Scott Laurence Friedman, Icahn School of 
Medicine at Mount Sinai 
Sanjiv Sam Gambhir, Stanford Univ. 
Katherine Amberson Hajjar, Weill Cornell 
Medical College 

Barbara C. Hansen, Univ. of South Florida 
Christopher Charles William Hughes, Univ. 
of California, Irvine 

Rakesh K. Jain, Massachusetts General 
Hospital/Harvard Medical School 
Robert Gordon Kalb, Children's Hospital of 
Philadelphia/Univ. of Pennsylvania Perel- 
man School of Medicine 
Fadio Raja Khuri, Winship Cancer Institute, 
Emory Univ. 

Margaret Kielian, Albert Einstein College of 
Medicine 

Richard N. Kitsis, Albert Einstein College of 
Medicine 

Carol Ann Kumamoto, Tufts Univ. 

Stephen B. Liggett, Univ. of South Florida 
College of Medicine 
Jay S. Loeffler, Massachusetts General 
Hospital 

Joseph Loscaizo, Brigham and Women’s 
Hospital/Harvard Medical School 


Simon Alexander Mallal, Vanderbilt Univ. 

School of Medicine 
John Mendelsohn, Univ. of Texas MD 
Anderson Cancer Center 
Dennis W. Metzger, Albany Medical College 
Jeffrey J. Molldrem, Univ. of Texas MD 
Anderson Cancer Center 
David R. Piwnica-Worms, Univ. of Texas MD 
Anderson Cancer Center 
Fernando P. Polack, Vanderbilt Univ. School 
of Medicine 

Katya Ravid, Boston Univ. School of 
Medicine 

Barrett Jon Rollins, Dana-Farber Cancer 
Institute 

Nancy Hartman Ruddle, Yale Univ. School of 
Public Health 

David J. Salant, Boston Univ. Medical Center 
Erica Ollmann Saphire, Scripps Research 
Institute 

P. Sarita Soni, Indiana Univ. 

David Samuel Stephens, Emory Univ. 
Woodruff Health Sciences Center School 
of Medicine 

Patrick J. Stover, Cornell Univ. 

Bruce Alan Sullenger, Duke Univ. Medical 
Center 

James Ward Thomas II, Vanderbilt Univ. 

School of Medicine 
Kevin J. Tracey, Eeinstein Institute for 
Medical Research 

Jan Vijg, Albert Einstein College of Medicine 
Paul A. Welling, Univ. of Maryland School of 
Medicine 

Section on Neuroscience 

Richard Warren Aldrich, Univ. of Texas at 
Austin 

Silvia Arber, Univ. of Basel/Friedrich 
Miescher Institute for Biomedical 
Research (Switzerland) 

Rita Balice-Gordon, Pfizer, Inc. 

Francisco Bezanilla, Univ. of Chicago 
Samuel A. Deadwyler, Wake Forest Univ. 

School of Medicine 
Daniel J. Goldman, Univ. of Michigan 
David M. Holtzman, Washington Univ. in 
St. Louis 

Christof Koch, Allen Institute for Brain 
Science 

Alex Leo Kolodkin, Johns Hopkins Univ. 
School of Medicine 

Michael Stephen Levine, Univ. of California, 
Los Angeles 

David G. Morgan, Univ. of South Florida 
Ronald William Qppenheim, Wake Forest 
Univ. School of Medicine 
Marina R. Picciotto, Yale Univ. School of 
Medicine 

Jeffrey D. Schall, Vanderbilt Univ. 

Masatoshi Takeichi, RIKEN (Japan) 

Stephen ErancisTraynelis, Emory Univ. 

School of Medicine 
Ean Wang, Duke Univ. Medical Center 
George D. Yancopoulos, Regeneron 
Pharmaceuticals, Inc. 

Charles Zuker, Columbia Univ. 
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Section on Pharmaceutical Sciences 

Suresh V. Ambudkar, National Cancer 
Institute/NIH 

Douglas A. Bayliss, Univ. of Virginia 
Peter Buchwald, Univ. of Miami School of 
Medicine 

Alice M. Clark, Univ. of Mississippi 
Varsha Gandhi, Univ. of Texas MD Anderson 
Cancer Center 

Randy A. Hall, Emory Univ. School of 
Medicine 

Stephen S. Hecht, Univ. of Minnesota 
Medical School 

Sung Wan Kim, Univ. of Utah College of 
Pharmacy 

Gary R. Matzke, Virginia Commonwealth 
Univ. School of Pharmacy 

Section on Physics 

Andrew J. Baker, Rutgers, The State Univ. of 
New Jersey 

Robert Allen Bartynski, Rutgers, The State 
Univ. of New Jersey 

Philip G. Collins, Univ. of California, Irvine 
Priscilla B. Cushman, Univ. of Minnesota 
Carlos Henrique de Brito Cruz, Sao Paulo 
Research Foundation (FAPESP) (Brazil) 
Alan T. Dorsey, Univ. of Georgia 
John Harte, Univ. of California, Berkeley 
Mary Y.R Hockaday, Los Alamos National 
Laboratory 

David Larbalestier, National High Magnetic 
Field Laboratory, Florida State Univ. 


Marvin L. Marshak, Univ. of Minnesota 

Roberto Daniel Merlin, Univ. of Michigan 

Mark Newman, Univ. of Michigan 

Harrison B. Prosper, Florida State Univ. 

Bruce Arne Sherwood, North Carolina State 
Univ. 

Milton Dean Slaughter, Florida International 
Univ. 

Christopher Michael Sorensen, Kansas 
State Univ. 

Nancy L. Thompson, Univ. of North Carolina 
at Chapel Hill 

AmirYacoby, Harvard Univ. 

Section on Psychology 

John T. Bruer, James S. McDonnell 
Foundation 

Robert S. Feldman, Univ. of Massachusetts, 
Amherst 

Wilma Koutstaal, Univ. of Minnesota 

Stanley Abraham Kuczaj II. Univ. of Southern 
Mississippi 

Nan Bernstein Ratner, Univ. of Maryland, 
College Park 

Eric Wanner, Russell Sage Foundation 

Section on Social, Economic, and 

Political Sciences 

Julia Ingrid Lane, American Institutes for 
Research/Univ. of Strasbourg (France)/ 
Univ. of Melbourne (Australia) 

Dudley L. Poston Jr., Texas A&M Univ. 


Barbara Schneider, Michigan State Univ. 
Mitchel B. Wallerstein, Baruch College 
Michael Joseph White, Brown Univ. 

Section on Societal Impacts of Science 
and Engineering 

Elizabeth A. Chornesky, Independent 
Analyst 

T. Taylor Eighmy, Univ. of Tennessee, 
Knoxville 

Anthony Fainberg, Institute for Defense 
Analyses 

Micah Daniel Lowenthal, National Academy 
of Sciences 

Section on Statistics 

Susmita Datta, Univ. of Louisville 
Mark Andrew Espeland, Wake Forest Univ. 
School of Medicine 

Robert Jackson Hardy, Univ. of Texas School 
of Public Health 

William Q. Meeker Jr., Iowa State Univ. 

Paula Karen Roberson, Univ. of Arkansas for 
Medical Sciences 
John P. Sail, SAS Institute 
James J. Schlesselman, Univ. of Pittsburgh 
Stephanie Shipp, Virginia Tech 
Clifford Spiegelman, Texas A&M Univ. 


AAAS Kavli Science Journalism 
Award winners named 


By Earl Lane 

S tories exploring human biology, in- 
cluding our interactions with the tril- 
lions of microbes we all harbor, the 
influences of our fishy evolutionary 
forebears on how we look, and the 
enduring challenge of understanding 
cancer, are among the winners of the 2014 
AAAS Kavli Science Journalism Awards. 

Large Newspaper— Circulation of 
100,000 or more: George Johnson, The 
New York Times, for “Why Everyone Seems 
to Have Cancer,” 5 January 2014; “A Tumor, 
the Embryo’s Evil Twin,” 18 March 2014; 
and “An Apple a Day, and Other Myths,” 22 
April 2014. 

Small Newspaper— Circulation less 
than 100,000: Matthew LaPlante and 
Paul Christiansen, Salt Lake City Weekly, 
for “Devastated: The World’s Largest 
Organism is in Utah— and It’s Dying,” 
21 November 2013. 


Magazine: David Dobbs, Pacific 

Standard, for “The Social Life of Genes,” 
September/October 2013. 

Television— Spot News/Feature Re- 
porting (20 minutes or less): Michael 
Werner, KCTS 9/QUEST, for “The Ecology 
of Fear,” 6 March 2014. 

Television— In Depth Reporting (more 
than 20 minutes): Michael Rosenfeld, Da- 
vid Dugan, and Neil Shubin, Tangled Bank 
Studios/Windfall Films for PBS, for “Your 
Inner Fish,” 9 April, 16 April, & 23 April 2014. 

Radio: Rob Stein, NPR, for “Staying 
Healthy May Mean Learning To Love Our 
Microbiomes,” 22 July 2013; “From Birth, 
Our Microbes Become As Personal As A 
Fingerprint,” 9 September 2013; and 
“Getting Your Microbes Analyzed Raises Big 
Privacy Issues,” 4 November 2013. 

Online: Amy Dockser Marcus, The Wall 
Street Journal, for “Trials: A Desperate 
Fight to Save Kids and Change Science,” 14 
November 2013. 



Children’s Science News: Mara Grun- 
baum. Scholastic Science World, for “Biting 
Back,” 16 September 2013; “Underwater Ad- 
venturer,” 7 October 2013; and “Swallowed 
Up,” 3 February 2014. 

The awards, administered by AAAS since 
their inception in 1945, go to professional 
journalists for distinguished reporting for 
a general audience. The Kavli Foundation 
provided a generous endowment in 2009 
that ensures the future of the program. 

Independent panels of science journalists 
pick the winners, who will receive $3000 
and a plaque at the 2015 AAAS Annual 
Meeting in San Jose, California, in February. 
Learn more at www.aaas.org/sja2014. ■ 
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Remnant magnetization 
in a primitive meteorite 


Fu et al., p. 1089 
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Wild 

cucumber 


Biosynthetic pathway hoids 
roots of domestication 


T he wild cucumber is a spiky, bitter 
relative of what we now grow in 
our gardens. The bitterness comes 
from cucurbitacin, which helps 
the plant to ward off herbivores. 
Cucurbitacin is also useful to people 
for its anti-tumor properties. Shang et 
a/, have now worked out the biosynthetic 
pathway of cucurbitacin. Along the way, 
they discovered genetic traces of the 
domestication process and unraveled the 
mystery of why some cucumbers, if grown 
in chilly conditions, become bitter. — PJH 
Science, this issue p. 1084 


PLANT SCIENCE 


BLACK HOLE PHYSICS 

Timing tells the structure 
of the unseen 

Nothing can move faster than 
the speed of light, but some 
gamma rays seem to break that 
rule. Powerful jets that balance 
the angular momentum of 
accreting black holes are 
difficult to discern in images, 
so astronomers often resort to 
timing their emission to reveal 
the physics at work. Aleksic et at. 
found that gamma rays from 
the active galaxy 1C 310 varied 
faster than the time required 
for light to cross the event 
horizon of the supermassive 
black hole at its nucleus. Particle 
acceleration at the base of the 
jet may enable this apparent 
speed, adding a piece to 
the puzzle of how jets form at 
supermassive black holes. 

- MMM 

Science, this issue p. 1080 


MATERIALS SCIENCE 

Graphene: A miniature 
bulletproof vest? 

To stop a speeding projectile, you 
need a combination of strength 
and toughness so that the impact 
doesn’t just pierce the stopping 
material. The material also needs 
to dissipate the absorbed kinetic 
energy. Lee etal. measured the 
response of multilayer graphene 
to the projection of microbullets 
using miniaturized ballistic tests. 
The findings confirm graphene's 
exceptional strength and stiff- 
ness. — MSL 

Science, this issue p. 1092 

CHEMICAL OCEANOGRAPHY 

Polluting the way 
to more productivity 

Most biologically available nitro- 
gen comes from the recycling 
of organic matter and nitrogen 


fixation. However, airborne 
anthropogenic nitrogen— air 
pollution— can also provide a 
source of such nitrogen. Kim 
etal. reconstructed changes in 
the N content of surface water 
across the North Pacific Ocean 
for the past four decades. N 
concentrations have increased 
markedly. This trend could 
enhance microbial growth in the 
ocean and eventually increase 
production of the greenhouse 
gas N 2 O. - HJS 

Science, this issue p. 1102 


COGNITIVE PSYCHOLOGY 

Forgetting history one 
president at a time 

Memory for past events and 
famous people, some of which 
may have happened or lived 
before we were born, fades 
with time. Roediger and DeSoto 
measured the extent and rate 


I at which former U.S. presidents 
have been forgotten (see the 
Perspective by Rubin). Three 
cohorts of college students, 
spanning four decades, forgot 
in the same way: Each group 
remembered Washington and 
Lincoln and also the most recent 
office holders (Ford, Reagan, 
and Obama). — GJC 

Science, this issue p. 1106: 

see also p. 1058 


SKIN DISEASE 

Personalized cell therapy 
for skin disorder 

Mutations in the COL7A1 gene 
cause severely fragile skin 
and painful blistering in a rare 
disease, recessive dystrophic 
epidermolysis bullosa (RDEB). 
There is no cure, but Sebastiano 
etai. now show that cell therapy 
could be a viable option. The 
authors took skin biopsies from 
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three patients with RDEB and 
generated induced pluripotent 
stem cells, or iPSCs, from the 
tissue. They genetically cor- 
rected the COL7A1 mutation 
and made keratinocytes (skin 
cells) that expressed normal 
collagen VII. In mice, these 
"repaired” keratinocytes formed 
layers of human skin expressing 
collagen VII. The skin grafts only 
lasted for a few weeks, but the 
ability to genetically correct and 
bank a patient's own cells is an 
important step toward treating 
RDEB. -MLF 

Sc/. TransL Med. 6 , 264ral63 (2014). 


PALEOGENOMICS 

Secrets of human ancestor 
evolution revealed 

Studies of ancient humans help 
us understand the movement 
and evolution of modern popula- 
tions of humans. Seguin-Orlando 
etal. present the genome of an 
ancient individual, K14, from 
northern Russia who lived over 
36,000 years ago. K14 is more 
similar to west Eurasians and 
Europeans than to east Asians, 
indicating that these populations 
had already diverged. — LMZ 

Science, this issue p. 1113 


REPELLENT SURFACES 

Undercutting the surface 
keeps liquids at bay 

The shape of an umbrella is 
designed both to protect the 
holder from falling droplets 
and to have the collected 
rainwater flow away from the 





person underneath. Liu and Kim 
exploited the idea of an umbrella 
to make materials with a surface 
that repels almost any liquid. 

- MSL 

Science, this issue p. 1096 

MITOCHONDRIA 

Cross talk between 
mitochondria and mitosis 

Mitochondria, the power houses 
of the cell, contain their own 
genome. Nevertheless, the 
majority of their constituent 
proteins are encoded by nuclear 
genes and are translated in the 
cytosol. During the cell cycle, a 
cell needs to duplicate each of its 
constituent parts and organelles. 
Surprisingly, it has never been 
clear how or whether mitochon- 
drial protein import is linked with 
the cell cycle. Harbauer et al. (see 
the Perspective by Schulz and 
Rehling) now show that one of 
the main conduits for mitochon- 
drial protein import is directly 
regulated by phosphorylation 
during mitosis, and that this 
in turn promotes respiratory 
activity. — SMH 

Sc/ence, this issue p. 1109 


CANCER IMMUNOLOGY 

Retaining a barrier to 
cancer metastasis 

One way cancers can spread, or 
metastasize, is by invading blood 
vessels, which are lined with 
endothelial cells. Pignatelli etal. 
found that cells from biopsies of 
invasive breast cancers crossed 
a layer of endothelial cells 
more readily than those from 
less aggressive breast cancers, 
particularly when cultured 
with macrophages (see the 
Perspective by Kiersse et al.). 
Macrophages promoted breast 
cancer cell invasion by secret- 
ing the growth factor EGF, which 
induced breast cancer cells to 
secrete the cytokine CSF-1. The 
cytokine acted on macrophages 
and on some types of cancer 
cell. Blocking the binding of 
CSF-1 to its receptor prevented 
cancer cells from crossing endo- 
thelial cultures. — LKF 

Sci. Signal. 7 , rall2 and pe28 (2014). 


IN OTHER JOURNALS 


Edited by Kristen Mueller 
and Jesse Smith 


Common ravens 
{Conus corax) engaged 
in a mating behavior. 
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ANiMAL BEHAViOR 

Up and down the raven social ladder 

S ocial hierarchies are complex, and threats abound for 
people and, as it turns out, for ravens, too. Massen 
etal. followed a group of approximately 200 individu- 
ally marked ravens in the Austrian Alps. They found 
that strongly bonded breeding pairs at the top of the 
raven social hierarchy disrupted interactions between loosely 
bonded pairs lower in the hierarchy. They also tended to 
ignore nonbonded individuals located at the bottom of the 
hierarchy. In doing so, the higher-ranking birds may prevent 
the lower-status breeding pairs from rising up the hierarchy 
and becoming a social threat. — GR 

Curr. Biol. 24 , 2733 (2014). 
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VERTEBRATE EVOLUTION 


Turtles’ breathing evolved at a turtle’s pace 


strongly influenced by -200- 
year-long solar cycles. — NW 

Proc. Natl. Acad. Sd. U.S.A. 10.1073/ 
pnas.1405237111 (2014). 


GENE REGULATION 

Insulin secretion 
organized in the nucleus 

Cells can modify gene expres- 
sion by changing the spatial and 
physical landscape of DNA in 
the nucleus. This allows a gene 
to Influence the expression of 
another gene that may be located 
several megabases down the 
chromosome. With this in mind 
and to find new genes that may 
influence how insulin functions, 

Xu et al. looked for genes that 
physically contacted the insulin 
gene in human pancreatic beta 
cells, cells that make and store 
Insulin. They found that the 
Insulin promoter contacted and 
regulated VneANOl gene, located 
more than half a chromosome 
away, which encodes a calcium- 
activated chloride channel. 
Glucose increased /ANOJ expres- 
sion, and loss oiANOl inhibited 
insulin secretion. — LBR 

Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pna5.1419240111 (2014). 


n most vertebrates, muscles associated with the lungs (intercostal muscles) work with the 
abdominal muscles to control breathing. In turtles, however, the rigid shell immobilizes the inter- 
costal muscles, so only the abdominal muscles control lung expansion. How did such a major shift 
in an essential physiological function evolve? To find out, Lyson eta/, compared extinct and extant 
vertebrates. The authors found that this transition proceeded slowly and began with a broadening 
of the ribs in a species that existed 50 million years before the first fully shelled turtle. — SNV 

Nat. Commun. 10.1038/ncomms6211 (2014). 


NEUROSCIENCE 

Virtual reality changes 
neuronal firing 

Place cells are a type of neuron in 
the brain that gives you a sense 
of place and allows for spatial 
navigation. Visual cues are impor- 
tant for determining place cell 
function, but do other sensory or 
motor cues also play a role? To 
find out, Aghajan et a/, compared 
rats navigating mazes in the real 
world and rats doing the same 
thing in virtual reality, where the 
only input they received was from 
distal visual cues. By monitor- 
ing the firing of place cells, the 


authors found that these cells 
did not perform optimally in the 
rats experiencing the virtual 
reality mazes. These results sug- 
gest that place cells need more 
than just visual signals. — PRS 

Nat. Neurosci. 10.1038/ 
nn.3884(2014). 


ORGANIC GEOCHEMISTRY 

Looking at lipids 
with lasers 

The remnants of microbial 
lipids preserved in sediments 
provide reliable records of 
past aquatic environments. 


Extracting trace amounts of 
these molecules from sedi- 
ment cores is often tricky and 
requires a large amount of 
material, which causes the time 
resolution of these records 
to be rather coarse. Wormer 
etal. used a high-resolution 
mass spectrometer coupled 
to a laser ionization source to 
track variations of nanogram 
levels of archaeal tetraethers 
at a resolution of -4 years. Tiny 
variations in lipid structure and 
abundance in sediments from 
the Mediterranean Sea with ages 
of over 10,000 years show that 
planktonic archaea there were 


MICROFLUIDICS 

Sorting cells by 
subpopulations 

The loss of cells from a tumor into 
the bloodstream creates circulat- 
ing tumor cells (CTCs) that may 
allow a cancer to metastasize. 

Within this population of CTCs, a 
heterogeneous range of cells may 
exist. Mohamad! etal. have built 
a microfluidic device that can col- 
lect CTCs and differentiate them 
into distinct groups. Separation is 
based on the quantity of the epi- 
thelial marker EpCAM found on 
the surface, which may be related 
to the metastatic potential of the 
CTC. Magnetic nanoparticles that 5 
target EpCAM may cling to the ^ 
cell surface, allowing the particles g 
to be separated in controlled ^ 

flow zones based on their ability | 
to resist to the fluid flow due to S 
magnetic attraction. — MSL J 

Angew. Chem. 10.1002/ g 

ange.201409376 (2014). £ 
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MINERALOGY 

A mineral name for 
mantle perovskite 

A rock from outer space finally 
puts a name to Earth’s most 
abundant mineral, frequently 
referred to as perovskite. Mineral 
names are only bestowed on 
specimens that are found in 
nature and characterized. 
Tschauner ef a/, isolate a mag- 
nesium silicate in the perovskite 
structure, now called bridgman- 
ite, in the Tenham L6 chondrite 
meteorite (see the Perspective 
by Sharp). Bridgmanite formed 
in this meteorite during a high- 
pressure and -temperature 
shock event. Other minerals 
associated with bridgmanite 
allow the pressure-temperature 
conditions to be narrowly bound. 


giving insight into the shock 
process. The long-sought-after 
specimen finally puts to rest 
a confusing nomenclature of 
this dense deep mantle silicate. 
-BG 

Science, this issue p. 1100; 

see also p. 1057 


PALEOMAGNETISM 

Magnetic moments In 
planetary history 

To know the magnetic history 
of the solar nebula in the age 
of planet formation, research- 
ers turn to the most primitive 
meteorites. Samples such as 
the Semarkona chondrite are 
composed partly of chondrules, 
which reflect the strength of the 
ambient magnetic field when 


this material was last molten. 

Fu etal. used a SQUID micro- 
scope to measure the remnant 
magnetization in a section of 
Semarkona. The findings reveal 
secrets about what goes on 
inside protoplanetary disks. 

- MMM 

Science, this issue p. 1089 


GENOME ENGINEERING 

CRISPR-cas: A revolution 
In genome engineering 

The ability to engineer genomic 
DNA in cells and organisms easily 
and precisely will have major 
implications for basic biology 
research, medicine, and biotech- 
nology. Doudna and Charpentier 
review the history of genome 
editing technologies, including oli- 
gonucleotide coupled to genome 
cleaving agents that rely on 
endogenous repair and recombi- 
nation systems to complete the 
targeted changes, self-splicing 
introns, and zinc-finger nucleases 
and TAL effector nucleases. They 
then describe how clustered 
regularly interspaced palindromic 
repeats (CRISPRs), and their 
associated (Cas) nucleases, were 
discovered to constitute an adap- 
tive immune system in bacteria. 
They document development of 
the CRISPR-Cas system into a 
facile genome engineering tool 
that is revolutionizing all areas of 
molecular biology. — GR 

Science, this issue p. 1077 


TETANUS TOXIN ENTRY 

A potential peptide to 
prevent tetanus? 

Tetanus (TeNT) and botulinum 
(BoNT) neurotoxins represent 
a family of powerful bacterial 
protein toxins that cause tetanus 
and botulism in humans and 
animals. The molecular mecha- 
nisms responsible for the entry 
and axonal retrograde transport 
of these toxins have been the sub- 
ject of intense research. However, 
tetanus and botulism remain 
incurable, at least in part because 


of their high-affinity binding to 
synapses. Although the recep- 
tors for BoNT have recently been 
characterized at the molecular 
level, no protein receptor for TeNT 
at the neuromuscular junction 
has been identified. Bercsenyi ef 
al. now suggest that TeNT exploits 
nidogen-1 and -2 for its binding to 
motor neurons. This binding is 
required for TeNT’s internalization 
and axonal retrograde trans- 
port. Nidogens are extracellular 
matrix proteins that engage in 
multiple protein-protein interac- 
tions essential for the integrity 
of several tissues, including the 
nervous system. Interfering with 
the interaction between nidogens 
and TeNT by administering short 
nidogen-derived peptides blocked 
toxin binding to the neuromuscu- 
lar junction and protected mice 
from tetanus. — SMH 

Science, this issue p. 1118 


T CELL SIGNALING 

Stimulatory signals add 
up for T cells 

T cell activation is a dynamic 
process. T cells encounter 
multiple input signals such 
as antigens, costimulatory 
molecules, and cytokines at 
different times and anatomical 
locations during an infection. 

But how do T cells integrate this 
information to determine the 
extent to which they divide? To 
find out, Marchingo etal. stimu- 
lated mouse T cells in culture 
with different combinations of 
inputs and also tracked antigen- 
specific T cell responses in mice 
infected with influenza virus. 
They found that T cells linearly 
sum the various stimulatory 
inputs they receive to determine 
their "division destiny." — KLM 

Science, this issue p. 1123 


FUNGAL BIOGEOGRAPHY 

Assessing fungal 
diversity worldwide 

Fungi are hyperdiverse but poorly 
known, despite their ecological 
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and economic impacts. Tedersoo 
etal. collected nearly 15,000 
topsoil samples from 365 sites 
worldwide and sequenced their 
genomes (see the Perspective 
by Wardle and Lindahl). Overall, 
they found a striking decline 
in fungal species richness with 
distance from the equator. For 
some specialist groups though, 
diversity depended more on the 
abundance of host plants than 
host diversity or geography. 

The findings reveal a huge gap 
between known and described 
species and the actual numbers 
of distinct fungi in the world’s 
soils. — CA 

Science, this issue p. 1078; 

seeaiso p. 1052 


DNA REPAIR 

Staggered incisions 
unhook DNA crosslinks 

Mutations that covalently link 
the two stands of DNA together, 
known as interstrand cross-links 
(ICLs), can wreak havoc with 
the many processes that need 
to separate the DNA strands to 
read the genome. For example, 
ICLs are commonly generated 
in the skin by exposure to UV. 
Wang et al. determined the 
crystal structure of the ICL 
repair exonuclease FANl bound 
to a branched segment of DNA 
that mimics an intermediate 
on the ICL repair pathway. The 
enzyme anchors itself at a cut 


end of DNA and then cleaves 
successively at three nucleo- 
tide intervals, a pattern ideal 
for unhooking the ICL. FANl 
seems to have evolved to act, at 
least in part, at sites coupled to 
DNA synthesis or homologous 
recombination. — GR 

Science, this issue p. 1127 


SYSTEMS BIOLOGY 

Deciphering information 
flow in T cells 

We can now measure the 
activation state of multiple 
components of biochemical 
signaling pathways in single 
cells. This ability reveals how 


information flows through such 
cellular regulatory pathways 
and how it is altered in disease. 
Krishnaswamy et al. applied sta- 
tistical techniques to overcome 
the complexity and variation (or 
noise) in such single-cell mea- 
surements. They used these 
techniques to quantify informa- 
tion transfer between proteins 
that participate in antigen 
recognition in cells of the 
immune system. The methods 
should prove useful in analysis 
of other signaling circuits to 
enhance basic understanding 
and reveal potential therapeutic 
targets to fight disease. 

-LBR 

Science, this issue p. 1079 
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REVIEW SUMMARY 


GENOME EDITING 

The new frontier of genome 
engineering with CRISPR-Cas9 

Jennifer A. Doudna* and Enunanuelle Charpentier* 


BACKGROUND: Technologies for mak- 
ing and manipulating DNA have enabled 
advances in biology ever since the discov- 
ery of the DNA double helix. But intro- 
ducing site-specific modifications in the 
genomes of cells and organisms remained 
elusive. Early approaches relied on the 
principle of site-specific recognition of 
DNA sequences by oligonucleotides, small 
molecules, or self-splicing introns. More 
recently, the site-directed zinc finger nu- 
cleases (ZFNs) and TAL effector nucleases 
(TALENs) using the principles of DNA- 
protein recognition were developed. How- 
ever, difficulties of protein design, synthe- 
sis, and validation remained a barrier to 


widespread adoption of these engineered 
nucleases for routine use. 

ADVANCES: The field of biology is now ex- 
periencing a transformative phase with the 
advent of facile genome engineering in ani- 
mals and plants using RNA-programmable 
CRISPR-Cas9. The CRISPR-Cas9 technology 
originates from type II CRISPR-Cas systems, 
which provide bacteria with adaptive immu- 
nity to viruses and plasmids. The CRISPR- 
associated protein Cas9 is an endonuclease 
that uses a guide sequence within an RNA 
duplex, tracrRNAxrRNA, to form base pairs 
with DNA target sequences, enabling Cas9 to 
introduce a site-specific double-strand break 


Matching DNA 
target sequence 


sgRNA 

binding to 
Cas9 

PAM 

sequence 



CRISPR-Cas9 development 

DNA deletion 
DNA insertion 
DNA replacement 
1^- DNA modification 
DNA labeling 
Transcription modulation 
RNA targeting 


CRISPR-Cas9 applications 

Biological research 
Research and development 
Human medicine 
Biotechnology 
Agriculture 


The Cas9 enzyme (blue) generates breaks In double-stranded DNA by using Its two 
u catalytic centers (blades) to cleave each strand of a DNA target site (gold) next to a 
u PAM sequence (red) and matching the 20-nucieotide sequence (orange) of the singie 
I guide RNA (sgRNA). The sgRNA includes a dual-RNA sequence derived from CRISPR RNA 
I (light green) and a separate transcript (tracrRNA, dark green) that binds and stabilizes the 
I Cas9 protein. Cas9-sgRNA-mediated DNA cleavage produces a blunt double-stranded break 
I that triggers repair enzymes to disrupt or replace DNA sequences at or near the cleavage 
I site. Catalytically inactive forms of Cas9 can also be used for programmable regulation of 
h transcription and visualization of genomic loci. 


in the DNA. The dual tracrRNA: crRNA was 
engineered as a single guide RNA (sgRNA) 
that retains two critical features: a sequence 
at the 5' side that determines the DNA tar- 
get site by Watson- Crick base-pairing and 
a duplex RNA structure at the 3' side that 
binds to Cas9. This finding created a simple 
two-component system in which changes in 
the guide sequence of the sgRNA program 
Cas9 to target any DNA sequence of interest. 
The simplicity of CRISPR-Cas9 program- 
ming, together with a unique DNA cleaving 
mechanism, the capacity for multiplexed tar- 
get recognition, and the existence of many 
natural type II CRISPR-Cas system variants, 
has enabled remarkable developments using 
this cost-effective and easy-to-use technol- 
ogy to precisely and efficiently target, edit, 
modify, regulate, and mark genomic loci of a 
wide array of cells and organisms. 


ON OUR WEB SITE 


Read the full article 
at http://dx.dol 
.org/10.1126/ 
sclence.1258096 


OUTLOOK: CRISPR-Cas9 has triggered a 
revolution in which laboratories around 
the world are using the technology for in- 
novative applications in biology. This Re- 
view Illustrates the power of the technology 
to systematically analyze gene functions 
in mammalian cells, study genomic rear- 
rangements and the progression of cancers 
or other diseases, and potentially correct 
genetic mutations responsible for inherited 
disorders. CRISPR-Cas9 is having a major 

impact on functional 

genomics conducted 
in experimental sys- 
tems. Its application in 
genome-wide studies 
will enable large-scale 
screening for drug tar- 
gets and other phenotypes and will facili- 
tate the generation of engineered animal 
models that will benefit pharmacological 
studies and the understanding of human 
diseases. CRISPR-Cas9 applications in plants 
and fungi also promise to change the pace 
and course of agricultural research. Future 
research directions to improve the technol- 
ogy will include engineering or identifying 
smaller Cas9 variants with distinct specific- 
ity that may be more amenable to delivery 
in human cells. Understanding the homol- 
ogy-directed repair mechanisms that follow 
Cas9 -mediated DNA cleavage will enhance 
insertion of new or corrected sequences into 
genomes. The development of specific meth- 
ods for efficient and safe delivery of Cas9 
and its guide RNAs to cells and tissues will 
also be critical for applications of the tech- 
nology in human gene therapy. ■ 
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GENOME EDITING 

The new frontier of genome 
engineering with CRISPR-Cas9 

Jennifer A. Doudna^’^’®* and Emmannelle Charpentier^’®’®* 

The advent of facile genome engineering using the bacterial RNA-guided CRISPR-Cas9 
system in animals and plants is transforming biology. We review the history of CRISPR 
(clustered regularly interspaced palindromic repeat) biology from its initial discovery 
through the elucidation of the CRISPR-Cas9 enzyme mechanism, which has set the stage 
for remarkable developments using this technology to modify, regulate, or mark genomic 
loci in a wide variety of cells and organisms from all three domains of life. These results 
highlight a new era in which genomic manipulation is no longer a bottleneck to 
experiments, paving the way toward fundamental discoveries in biology, with applications 
in all branches of biotechnology, as well as strategies for human therapeutics. 


T echnologies for making and manipulating 
DNA have enabled many of the advances 
in biology over the past 60 years. This era 
began with the discovery of the DNA double 
helix and continued with the development 
of chemical methods for solid-phase DNA ssm- 
thesis, enabling detection and exploration of 
genome organization. Enzymes (including poly- 
merases, ligases, and restriction endonucleases) 
and the polymerase chain reaction (PCR) pro- 
vided ways to isolate genes and gene fragments, 
as wen as to introduce mutations into genes in 
vitro, in cells, and in model organisms. The ad- 
vent of genomic sequencing technologies and the 
rapid generation of whole-genome sequencing 
data for large numbers and types of organisms, 
including humans, has been one of the singular 
advances of the past two decades. Now, the RNA- 
guided enzyme Cas9, which originates from the 
CRISPR-Cas adaptive bacterial immune sys- 
tem, is transforming biology by providing a 
genome engineering tool based on the princi- 
ples of Watson-Crick base pairing. Ease of use 
and efficiency have led to rapid adoption by lab- 
oratories around the world. Below we discuss 
the history and biology of CRISPR systems, de- 
scribe the molecular mechanisms underlying 
genome editing by Cas9, and review the rapid 
advances in applications of this technology 
since its initial publication in 2012. 
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Genome engineering— 

A decades-long goal 

Ever since the discovery of the DNA double 
helix, researchers and clinicians have been con- 
templating the possibility of making site-specific 
changes to the genomes of cells and organisms. 
Many of the earliest approaches to what has 
been referred to as genome editing relied on the 
principle of site-specific recognition of DNA se- 
quences (Fig. 1). The study of natural DNA re- 
pair pathways in bacteria and yeast, as well as 
the mechanisms of DNA recombination (1-5), 
revealed that cells have endogenous machin- 
ery to repair double-strand DNA breaks (DSBs) 
that would otherwise be lethal (6-9). Thus, 
methods for introducing precise breaks in the 
DNA at sites where changes are to be introduced 
was recognized as a valuable strategy for tar- 
geted genomic engineering. 

Early approaches to such targeted DNA cleav- 
age took advantage of DNA base pair recognition 
by oligonucleotides or small molecules. Building 
on the original description of triple helix for- 
mation by Rich and colleagues in the late 1950s 
(10, 11), oligonucleotides coupled to chemical 
cleavage or cross-linking reagents such as bleo- 
mycin and psoralen were shown to be useful for 
site-specific chromosome modification in yeast 
and mammalian cells (12-17). Other methods for 
chemical recognition of DNA sequences, such as 
peptide nucleic acids (PNt^s) and polyamides, 
were shown to enable targeted binding of 
chromosomal loci that could be modified if the 
chemical recognition agent was coupled to a 
cleavage reagent such as bleomycin (18-20). 
Another strategy that relied on nucleic acid 
base pairing was the use of self-splicing introns 
to change sequences at the DNA (21, 22) or RNA 
(23) level. Mthough these approaches did not 
lead to robust methods, they demonstrated the 
utility of base pairing for site-specific genome 
modification. 


The use of self-splicing introns for genome 
editing also suggested the possibility of using 
intron-encoded nucleases— homing endonucleases— 
that are capable of site-specific DNA cleavage and 
integration of the intron sequence. By inserting 
desired sequences into the intron first, researchers 
could incorporate selected genetic information into 
a genome at sites recognized by the homing en- 
donuclease (24, 25). At around the same time, the 
initial reports of zinc finger-mediated DNA bind- 
ing (26, 27) led to the creation of modular DNA 
recognition proteins that, when coupled to the 
sequence-independent nuclease domain of the re- 
striction enzyme FokI, could function as site- 
specific nucleases (28). When designed to recog- 
nize a chromosomal sequence, such zinc finger 
nucleases (ZFNs) were found to be effective at in- 
ducing genomic sequence ehanges in Drosophila 
and mammalian cells (29, 30). vUthough ZFNs 
are effective genome editing reagents for some 
experiments, they were not widely adopted be- 
cause of the difficulty inherent in designing and 
validating such proteins for a specific DNA locus 
of interest. Thus, the field was primed for the 
first reports of transcription activator-like (TAL) 
effectors, which occur naturally in bacteria that 
infect plants, enabling rapid creation of Fokl- 
coupled versions that could be used similarly to 
ZFNs for site-directed genome editing (31-33). 
Such TAL effector nucleases (TALENs) were easier 
than ZFNs to produce and validate, generating 
widespread excitement about the possibility of 
facile genome editing that would be fast and 
inexpensive. But difficulties of protein design, 
synthesis, and validation remained a barrier to 
widespread adoption of these engineered nu- 
cleases for routine use. 

History and biology of 
CRISPR-Cas systems 

In a parallel but completely separate area of 
research, a few microbiology and bioinformatics 
laboratories in the mid-2000s began investigat- 
ing CRISPRs (clustered regularly interspaced 
palindromic repeats), which had been described 
in 1987 by Japanese researchers as a series of 
short direct repeats interspaced with short se- 
quences in the genome of Escherichia coli (34) 
(Fig. 1). CRISPRs were later detected in numer- 
ous bacteria and archaea (35), and predictions 
were made about their possible roles in DNA 
repair or gene regulation (36, 37). A key insight 
came in 2005 with the observation that many 
spacer sequences within CRISPRs derive from 
plasmid and viral origins (38-40). Together with 
the finding that CRISPR loci are transcribed (41) 
and the observation that cas (CRISPR-associated) 
genes encode proteins with putative nuclease 
and helicase domains (38, 40, 42, 43), it was 
proposed that CRISPR-Cas is an adaptive de- 
fense system that might use antisense RNAs as 
memory signatures of past invasions (44). In 2007, 
infection experiments of the lactic add bacterium 
Streptococcus thermophilus with lytic phages 
provided the first experimental evidence of CRISPR- 
Cas-mediated adaptive immunity (45). This finding 
led to the idea that natural CRISPR-Cas systems 
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existing in cultured bacteria used in the dairy in- 
dustry could be harnessed for immunization against 
phages— a first successful application of QIISPR- 
Cas for biotechnological purposes {46). In 2008, 
mature CRISPR RNAs (crRNAs) were shown to 
serve as guides in a complex with Cas proteins to 
interfere with virus proliferation in E. coli {47). 
The same year, the DNA targeting activity of the 
CRISPR-Cas system was reported in the pathogen 
Staphylococcus epidermidis {48). 

Functional CRISPR-Cas loci comprise a CRISPR 
array of identical repeats intercalated with invader 
DNA-targeting spacers that encode the crRNA 
components and an operon of cas genes encod- 
ing the Cas protein components. In natural en- 
vironments, viruses can be matched to their 
bacterial or archaeal hosts by examining CRISPR 
spacers {49, 50). These studies showed that vi- 
ruses are constantly evolving to evade CRISPR- 
mediated attenuation. 

Adaptive immunity occurs in three stages [for 
recent reviews, see (5/-53)]: (i) insertion of a short 
sequence of the invading DNA as a spacer se- 
quence into the CRISPR array; (ii) transcription of 
precursor crRNA (pre-crRNA) that undergoes 
maturation to generate individual crRNAs, each 


composed of a repeat portion and an invader- 
targeting spacer portion; and (iii) crRNA-directed 
cleavage of foreign nucleic acid by Cas proteins 
at sites complementary to the crRNA spacer se- 
quence. Within this overall theme, three CRISPR- 
Cas system types (I, II, and III) use distinct 
molecular mechanisms to achieve nucleic add 
recognition and cleavage {54, 55). The protospacer 
adjacent motif (PAM), a short sequence motif ad- 
jacent to the crRNA-targeted sequence on the in- 
vading DNA, plays an essential role in the stages 
of adaptation and interference in type I and type 
II systems {39, 56-58). The type I and type III 
systems use a large complex of Cas proteins for 
crRNA-guided targeting {47, 59-63). However, the 
type II system requires only a single protein for 
RNA-guided DNA recognition and cleavage {64, 65) 
—a property that proved to be extremely useful far 
genome engineering applications (see below). 

Functionality of CRISPR-Cas9 

Bioinformatic analyses first identified Cas9 (for- 
merly COG3513, Csxl2, Cas5, or Csnl) as a large 
multifunctional protein {36) with two putative nu- 
clease domains, HNH {38, 43, 44) and RuvC-like 
{44). Genetic studies showed that S. thermophilus 


Cas9 is essential for defense against viral inva- 
sion {45, 66), might be responsible for introduc- 
ing DSBs into invading plasmids and phages {67), 
enables in vivo targeting of temperate phages 
and plasmids in bacteria {66, 68), and requires 
the HNH and RuvC domains to interfere with 
plasmid transformation efficiency {68). 

In 2011 {66), trans-activating crRNA (tracrRNA) 
—a small RNA that is trans-encoded upstream of 
the type II CRISPR-Cas locus in Streptococcus 
pyogenes— was reported to be essential for crRNA 
maturation by ribonuclease III and Cas9, and 
tracrRNA-mediated activation of crRNA matu- 
ration was found to confer sequence-specific im- 
munity against parasite genomes. In 2012 {64), 
the S. pyogenes CRISPR-Cas9 protein was shown 
to be a dual-RNA-guided DNA endonuclease that 
uses the tracrRNAicrRNA duplex {66) to direct 
DNA cleavage {64) (Fig. 2). Cas9 uses its HNH 
domain to cleave the DNA strand that is com- 
plementary to the 20-nucleotide sequence of the 
crRNA; the RuvC-like domain of Cas9 cleaves the 
DNA strand opposite the complementary strand 
{64, 65) (Fig. 2). Mutating either the HNH or the 
RuvC-like domain in Cas9 generates a variant pro- 
tein with single-stranded DNA cleavage (nickase) 
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Fig. 1. Timeline of CRISPR-Cas and genome engineering research fields. Key developments in both fields are shown. These two fields merged in 2012 
with the discovery that Cas9 is an RNA-programmable DNA endonuclease, leading to the explosion of papers beginning in 2013 in which Cas9 has been 
used to modify genes in human cells as well as many other cell types and organisms. 
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activity, whereas mutating both domains (dCas9; 
Asp“ — ► Ala, His®'“ ^ Ala) results in an RNA- 
guided DNA binding protein (64, 63). DNA target 
recognition requires both base pairing to the 
crRNA sequence and the presence of a short se- 
quence (PAM) adjacent to the targeted sequence 
in the DNA (64, 63) (Fig. 2). 

The dual tracrRNAicrRNA was then engineered 
as a single guide RNA (sgRNA) that retains two 
critical features: the 20-nucleotide sequence at 
the 5' end of the sgRNA that determines the DNA 
target site by Watson-Crick base pairing, and the 
double-stranded structure at the 3' side of the 
guide sequence that binds to Cas9 (64) (Fig. 2). 
This created a simple two-component system in 
which changes to the guide sequence (20 nucleo- 
tides in the native RNA) of the sgRNA can be 
used to program CRISPR-Cas9 to target any DNA 
sequence of interest as long as it is adjacent to 
a PAM (64). In contrast to ZFNs and TALENs, 
which require substantial protein engineering for 
each DNA target site to be modified, the CRISPR- 
Cas9 system requires only a change in the guide 
RNA sequence. For this reason, the CRISPR-Cas9 


technology using the S. pyogenes system has been 
rapidly and widely adopted by the scientific com- 
munity to target, edit, or modify the genomes of a 
vast array of cells and organisms. Phylogenetic 
studies (69-71) as well as in vitro and in vivo 
experiments (64, 71, 72) show that naturally 
occurring Cas9 orthologs use distinct tracrRNA: 
crRNA transcripts as guides, defined by the 
specificity to the dual-RNA structures (69-71) (Fig. 
3). The reported collection of Cas9 orthologs con- 
stitutes a large source of CRISPR-Cas9 systems for 
multiplex gene targeting, and several orthologous 
CRISPR-Cas9 systems have already been applied 
successfully for genome editing in human cells 
[Neisseria meningitidis (73, 74), S. thermophilus 
(73, 73), and Treponema denticola (73)]. 

Although the CRISPR acronym has attracted 
media attention and is widely used in the scien- 
tific and popular literature, nearly all genome 
editing applications are based on the use of the 
protein Cas9 together with suitable sgRNAs. As 
discussed above, CRISPR refers to the repetitive 
nature of the repeats in the CRISPR arrays that 
encode crRNAs, and the term does not relate 


directly to genome engineering. Nonetheless we 
prefer to use “CRISPR-Cas9” in a way that is less 
restrictive than other nomenclatures that have 
been used in the field (76). 

Mechanism of CRISPR-Cas9-mediated 
genome targeting 

Structural analysis of S. pyogenes Cas9 has re- 
vealed additional insights into the mechanism 
of CRISPR-Cas9 (Fig. 3). Molecular structures of 
Cas9 determined by electron microscopy and 
x-ray crystallography show that the protein un- 
dergoes large conformational rearrangement 
upon binding to the guide RNA, with a further 
change upon association with a target double- 
stranded DNA (dsDNA). This change creates a 
channel, running between the two structural 
lobes of the protein, that binds to the RNA-DNA 
hybrid as well as to the coaxially stacked dual- 
RNA structure of the guide corresponding to 
the crRNA repeat-tracrRNA antirepeat interac- 
tion (77, 78). An arginine-rich a helix (77-79) 
bridges the two structural lobes of Cas9 and ap- 
pears to be the hinge between them, in addition 


A Genomic CRISPR locus 


r 


Maturation and interference 


Adaptation 



B tracrRNArcrRNA co-maturation and Cas9 co-complex formation 



pre-crRNA (precursor crRNA) 



Fig. 2. Biology of the type ll-A CRISPR-Cas system. The type ll-A system from S. pyogenes is shown as an example. (A) The cas gene operon with 
tracrRNA and the CRISPR array. (B) The natural pathway of antiviral defense Involves association of Cas9 with the antirepeat-repeat RNA (tracrRNA: 
crRNA) duplexes, RNA co-processing by ribonuclease III. further trimming, R-loop formation, and target DNA cleavage. (C) Details of the natural DNA 
cleavage with the duplex tracrRNA:crRNA. 
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to playing a central role in binding the guide 
RNA-target DNA hybrid as shown by mutagen- 
esis (77, 78). The conformational change in Cas9 
may be part of the mechanism of target dsDNA 
unwinding and guide RNA strand invasion, 
although this idea remains to be tested. Mech- 
anistic studies also show that the PAM is critical 
for initial DNA binding; in the absence of the 
PAM, even target sequences fuUy complemen- 
tary to the guide RNA sequence are not rec- 
ognized by Cas9 (SO). A crystal structure of Cas9 
in complex with a guide RNA and a partially 
dsDNA target demonstrates that the PAM lies 
within a base-paired DNA structure (Si). Arginine 
motifs in the C-terminal domain of Cas9 interact 
with the PAM on the noncomplementary strand 
within the major groove. The phosphodiester 
group at position +1 in the target DNA strand 
interacts with the minor groove of the duplexed 
PAM, possibly resulting in local strand separa- 
tion, the so-called R-loop, immediately upstream 
of the PAM (Si). Single-molecule experiments also 
suggest that R-loop association rates are affected 
primarily by the PAM, whereas R-loop stability 
is influenced mainly by protospacer elements 
distal to the PAM {82). Together with single- 
molecule and bulk biochemical experiments 
using mutated target DNAs, a mechanism can 
be proposed whereby target DNA melting starts 
at the level of PAM recognition, resulting in di- 
rectional R-loop formation expanding toward 
the distal protospacer end and concomitant RNA 
strand invasion and RNA-DNA hybrid forma- 
tion (80-82). 

To assess the target-binding behavior of Cas9 
in cells, researchers used chromatin immuno- 
precipitation and high-throughput sequencing 
(ChIP-seq) to determine the numbers and types 
of Cas9 binding sites on the chromosome. Re- 
sults showed that in both human embryonic kid- 
ney (HEK293) cells (83) and mouse embryonic 
stem cells (mESCs) (84), a catalsTically inactive 
version of Cas9 bound to many more sites than 
those matching the sequence of the sgRNA used 
in each case. Such off-target interactions with 
DNA, typically at sites bearing a PAM and par- 
tially complementary to the guide RNA se- 
quence, are consistent with established modes 


Cas9 programmed by 
crRNA:tracrRNA duplex 


Target 

DNA 



of DNA interrogation by Cas9 (80). Active Cas9 
rarely cleaves the DNA at off-target binding sites, 
implsdng decoupled binding and cleavage events 
in which nearly perfect complementarity between 
the guide RNA and the target site are necessary 
for efficient DNA cleavage. These observations are 
consistent with results obtained for Cas9-guide 
RNA complexes in single-molecule experiments 
(80). Furthermore, Cas9 binding events occur more 
densely in areas of open chromatin as compared 
to regions of compact, transcriptionally inactive 
chromatin. However, because the method involves 
cross-linking cells for ~10 min before quenching 
the reaction, transient and long-lived binding in- 
teractions cannot be distinguished. It is possible 
that many of the apparent off-target DNA in- 
teractions in fact reflect brief encounters that 
would not normally trigger strand invasion by 
the guide RNA. 

Engineering cells and model organisms 

Following the 2012 publication of Jinek et al. 
(64), three studies in January 2013 demonstrated 
that CRISPR-Cas9 represents an efficient tool to 
edit the genomes of human cells (75, 85, 86). 
The “humanized” versions of S. pyogenes Cas9 
(75, 85, 86) and S. thermophilus Cas9 (75) were 
coexpressed with custom-designed sgRNAs 
(75, 85, 86) or with tracrRNA coexpressed with 
custom-designed crRNAs (75) in human embry- 
onic kidney, chronic myelogenous leukemia, or 
induced pluripotent stem cells (75, 85, 86) as well 
as in mouse cells (75). The expected alterations 
in the target DNA were observed, indicating 
that site-specific DSBs by RNA-guided Cas9 had 
stimulated gene editing by nonhomologous end 
joining repair or gene replacement by homology- 
directed repair (Fig. 4). Targeting with multiple 
sgRNAs— referred to as multiplexing— was also 
successfully achieved (75, 86). RNA-programmable 
S. pyogenes Cas9-mediated editing has now been 
applied to various human cells and embryonic 
stem cells [(87-90); for reviews, see (91-93)). Al- 
though direct comparisons can be difficult to 
assess because of differences in target sites and 
protein expression levels, some analyses show 
that CRISPR-Cas9-mediated editing efficiencies 
can reach 80% or more depending on the target, 

Cas9 programmed by 
single guide RNA 


|3' crRNA 
15' tracrRNA 


Fig. 3. Evolution and structure of Cas9.The structure of S. pyogenes Cas9 in the unliganded and RNA 
forms [from (77, 8J)]. 



which is as high as or higher than levels observed 
using ZFNs or TALENs (89, 94). 

These initial studies were only the beginning 
of what has become an incredibly fast-paced field 
in which laboratories around the world have used 
CRISPR-Cas9 to edit genomes of a wide range of 
cell types and organisms (summarized in Fig. 5). As 
of this writing, more than 1000 papers have been 
published that include the CRISPR acronym in 
the title or abstract, with the majority of these 
published since the beginning of 2013. Many of 
these applications have been discussed in re- 
cent reviews (91-93). Here we highlight a few 
examples that illustrate the power of the tech- 
nology (Fig. 6). The first example is the precise 
reproduction of tumor-associated chromosomal 
translocations, which come about during carcino- 
genesis through illegitimate nonhomologous 
joining of two chromosomes. The ability of CRISPR- 
Cas9 to introduce DSBs at defined positions has 
made it possible to generate human cell lines and 
primary cells bearing chromosomal translations 
resembling those described in cancers such as 
lung cancer (95), acute myeloid leukemia, and 
Ewing’s sarcoma (96, 97). An improved method 
to generate liver cancer or myeloid malignancy 
models in mice facilitated by CRISPR-Cas9 was 
recently reported (98, 99). QIISPR-Cas9 thus pro- 
vides a robust technology for studying genomic 
rearrangements and the development and pro- 
gression of cancers or other diseases. 

A second example is the systematic analysis of 
gene functions in mammalian cells. A genome- 
scale lentiviral sgRNA library was developed 
to generate a pooled loss-of-function genetic 
screening approach suitable for both positive 
and negative selection (100, 101). This approach 
was also used to identify genes essential for 
cell viability in cancer and pluripotent stem cells 
(102). Although such studies have been attempted 
using RNA interference (RNAi) to reduce the 
expression of genes, this strategy does not allow 
the generation of gene knockouts and can suf- 
fer from substantial off-target effects. The use 
of CRISPR-Cas9 for genome-wide studies will 
enable large-scale screening for drug targets and 
other phenotypes and thus wiU expand the nature 
and utili^ of genetic screens in human and other 
nonmodel cell types and organisms. 

Other pertinent examples of 
CRISPR-Cas9 applications with rel- 
evance to human health include 
the ability to correct genetic muta- 
tions responsible for inherited dis- 
orders. A dominant mutation in 
the Crygc gene responsible for 
cataracts was successfully corrected 
in mice (103). Using cultured pri- 
mary adult intestinal stem cells 
derived from cystic fibrosis patients, 
the CFTR locus responsible for 
cystic fibrosis was corrected by 
Linker homologous recombination, result- 

loop ing in the clonal expansion of 

miniature organlike cell cultures 
DNA-bound (organoids) harboring the desired, 
exact genetic change (104). These 
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studies underscore the potential for this technol- 
ogy to be used for human gene therapy to treat 
genetic disorders. 

A last example of CRISPR-Cas9 as a genome 
engineering technology is its application to plants 
and fungi. Since its demonstration as a genome 
editing tool in Arabidopsis thaliana and Nicoti- 
ana benthamiana (103, 106), editing has been 
demonstrated in crop plants including rice, wheat, 
and sorghum as well as sweet orange and liverwort 


(107-111). This technology promises to change 
the pace and course of agricultural research. For 
example, a recent study in rice found that target 
genes were edited in nearly 50% of the embryogenic 
cells that received the Cas9-guide RNA constructs, 
and editing occurred before the first cell division 
(112). Furthermore, these genetic changes were 
passed to the next generation of plants without 
new mutation or reversion, and whole-genome 
sequencing did not reveal substantial off-target 


editing. Such findings suggest that modification 
of plant genomes to provide protection from 
disease and resistance to pests may be much easier 
than has been the case with other technologies. 
The regulatory implications of CRISPR-Cas9 tech- 
nology for use in plants are not yet clear and will 
certainly depend on the type of mutation(s) to be 
introduced. 

In general, the lack of efficient, inexpensive, 
fast-to-design, and easy-to-use precision genetic 
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tools has also been a limiting factor for the anal- 
ysis of gene functions in model organisms of 
developmental and regenerative biology. Effi- 
cient genome engineering to allow targeted 
genome modifications in the germ lines of ani- 
mal models such as fruit flies {113, 114), zebra- 
fish {94, 115), nematodes {116), salamanders {117), 
and frogs {118, 119) is now possible with the de- 
velopment of the CRISPR-Cas9 technology. The 
technology can also facilitate the generation of 
mouse {120-122) and rat {123, 124) models better 
suited to pharmacological studies and the un- 
derstanding of human diseases, as well as pigs 
{125) and monkeys {126). Overall, CRISPR-Cas9 is 
already having a major impact on functional 
genomic experiments that can be conducted in 
these model systems, which will advance the 
field of experimental biology in ways not imag- 
ined even a few years ago. 

Further development of the technology 

A key property of Cas9 is its ability to bind to 
DNA at sites defined by the guide RNA sequence 
and the PAM, allowing applications beyond 
permanent modification of DNA. In particular, 
a catalyticaUy deactivated version of Cas9 (dCas9) 
has been repurposed for targeted gene regula- 
tion on a genome-wide scale. Referred to as 
CRISPR interference (CRISPRi), this strategy 
was shown to block transcriptional elongation, 
RNA polymerase binding, or transcription factor 
binding, depending on the site(s) recognized by 
the dCas9-guide RNA complex. Demonstrated 
first in E. coli, whole-genome sequencing showed 
that there were no detectable off-target effects 
{127). CRISPRi has been used to repress multiple 
target genes simultaneously, and its effects are 
reversible {127-130). 

By generating chimeric versions of dCas9 that 
are fused to regulatory domains, it has been pos- 
sible to use CRISPRi for efficient gene regulation 
in mammalian cells. Specifically, fusion of dCas9 
to effector domains including VP64 or KRAB 
allowed stable and efficient transcriptional acti- 
vation or repression, respectively, in human and 
yeast cells {129). As observed in bacteria, site(s) of 
regulation were defined solely by the coexpressed 
guide RNA(s) for dCas9. RNA-seq analysis showed 
that CRISPRi-directed transcriptional repression 
is highly specific. More broadly, these results 
demonstrated that dCas9 can be used as a 
modular and flexible DNA-binding platform for 


the recruitment of proteins to a target DNA 
sequence in a genome, laying the foundation 
for future experiments involving genome-wide 
screening similar to those performed using 
RNAi. The lack of CRISPR-Cas systems in eu- 
karyotes is an important advantage of CRISPRi 
over RNAi for various applications in which 
competition with the endogenous pathways is 
problematic. For example, using RNAi to si- 
lence genes that are part of the RNAi pathway 
itself (l.e.. Dicer, Argonaute) can lead to results 
that are difficult to Interpret due to multiple 
direct and indirect effects. In addition, any RNAs 
used to silence specific genes may compete with 
endogenous RNA-mediated gene regulation in 
cells. With its ability to permanently change the 
genetic code and to up- or down-regulate gene 
expression at the transcriptional or posttranscrip- 
tional level, CRISPR-Cas9 offers a large versatility 
in harnessing alternatives, whereas RNAi is mostly 
restricted to knocking down gene expression. 
Although RNAi has been improving over the 
years, incomplete knockdowns or unpredictable 
off-targeting are still reported bottlenecks of this 
technology, and future comparative analyses 
should address the superiority of CRISPRi over 
RNAi in these aspects. 

The programmable binding capability of dCas9 
can also be used for imaging of spedfic lod in live 
cells. An enhanced green fluorescent protein- 
tagged dCas9 protein and a structurally optimized 
sgRNA were shown to produce robust imaging of 
repetitive and nonrepetitive elements in telomeres 
and coding genes in living cells {131). This CRISPR 
imaging tool has the potential to improve the cur- 
rent technologies for studying conformational 
dynamics of native chromosomes in living cells, 
particularly if multicolor imaging can be developed 
using multiple distinct Cas9 proteins. It may also 
be possible to couple fluorescent proteins or small 
molecules to the guide RNA, providing an orthog- 
onal strategy far multicolor imaging using Cas9. 

Novel technologies aiming to disrupt provi- 
ruses may be an attractive approach to eliminat- 
ing viral genomes from Infected Individuals and 
thus curing viral infections. An appeal of this 
strategy is that it takes advantage of the primary 
native functions of CRISPR-Cas systems as anti- 
viral adaptive immune systems in bacteria The 
targeted CRISPR-Cas9 technique was shown to 
efficiently cleave and mutate the long terminal 
repeat sites of HIV-1 and also to remove internal 
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Fig. 5. Examples of cell types and organisms that have been engineered using Cas9. 


viral genes from the chromosome of infected cells 
{132, 133). 

CRISPR-Cas9 Is also a promising technology in 
the field of engineering and synthetic biology. A 
multiplex CRISPR approach referred to as CRISPRm 
was developed to facilitate directed evolution of 
biomolecules {134). CRISPRm consists of the op- 
timization of CRISPR-Cas9 to generate quantita- 
tive gene assembly and DNA library Insertion 
into the fungal genomes, providing a strategy to 
improve the activity of biomolecules. In addition, 
it has been possible to induce Cas9 to bind single- 
stranded RNA in a programmable fashion by using 
short DNA oligonucleotides containing PAM se- 
quences (PAMmers) to activate the enzyme, sug- 
gesting new ways to target transcripts without 
prior affinity tagging {135). 

A series of studies have reported the efficiency 
with which the RNA-programmable S. pyogenes 
Cas9 targets and cleaves DNA and have also 
addressed the level of Its specificity by moni- 
toring the ratio of off-site targeting {136-140). 
Off-site targeting is defined by the tolerance of 
Cas9 to mismatches in the RNA guide sequence 
and is dependent on the number, position, and 
distribution of mismatches throughout the en- 
tire guide sequence {136-140) beyond the initial 
seed sequence originally defined as the first 8 
to 12 nucleotides of the guide sequence prox- 
imal to the PAM {64) (Fig. 2). The amount of 
Cas9 enzyme expressed in the cell is an im- 
portant factor in tolerance to mismatches {138). 
High concentrations of the enzyme were re- 
ported to increase off-site targeting, whereas 
lowering the concentration of Cas9 increases 
specificity while diminishing on-target cleavage 
activity {137). Several groups have developed 
algorithmic tools that predict the sequence of 
an optimal sgRNA with minimized off-taiget effects 
(for example, http://tools.genome-engineering.org, 
http://zifit.partners.org, and www.e-crisp.org) 
{141-145). The development of alternative genome- 
wide approaches that would also consider other 
features of the reaction, such as the thermody- 
namic properties of the sgRNA, may also in- 
crease the specificity of the design. 

Several studies of the CRISPR-Cas9 technol- 
ogy relate to the specificity of DNA targeting 
(Fig. 4): a double-nicking approach consisting 
of using the nickase variant of Cas9 with a pair 
of offset sgRNAs properly positioned on the 
target DNA {146-148); an sgRNA-guided dCas9 
fused to the FokI nuclease where 
two fused dCas9-FokI monomers 
can simultaneously bind target sites 
at a defined distance apart {149, 150); 
and shorter sgRNAs truncated by 
two or three nucleotides at the 
distal end relative to the PAM that 
can be used with the double nick- 
ing strategy to further reduce off- 
target activity {151). The first two 
methods rely on Cas9 dimerization 
similar to the engineered dimeric 
ZFNs and TALENs, with the princi- 
ple that two adjacent off-target bind- 
ing events and subsequent cleavage 
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are less likely to oecur than a single off-target 
eleavage (14S-150). The latter method follows the 
reasoning according to which the 5'-end nucleo- 
tides of the sgRNAs are not necessary for their 
full activity; however, they may compensate for 
mismatches at other positions along the guide 
RNA-target DNA interface, and thus shorter 
sgRNAs may he more specific (151). Future ef- 
forts will focus on further developing the pre- 
cision of the technology, as well as increasing 
the frequency of homology-directed repair rel- 
ative to nonhomologous end joining in order 
to favor site-specific insertion of new genetic 
information. 

Conclusions and perspectives 

Our understanding of how genomes direct de- 
velopment, normal physiology, and disease in 
higher organisms has heen hindered hy a lack of 
suitable tools for precise and efficient gene en- 
gineering. The simple two-component CRISPR- 
Cas9 system, using Watson-Crick base pairing by 
a guide RNA to identify target DNA sequences, is 
a versatile technology that has already stimulated 
innovative applications in biology. Understanding 
the CRISPR-Cas9 system at the biochemical 
and structural level allows the engineering of 
tailored Cas9 variants with smaller size and 
increased specificity. A crystal structure of the 
smaller Cas9 protein from Actinomyces, for exam- 
ple, showed how natural variation created a 
streamlined enzjmie, setting the stage for future 
engineered Cas9 variants (77). A deeper analysis 
of the large panel of naturally evolving bacterial 
Cas9 enzymes may also reveal orthologs with 
distinct DNA binding specificity, will broaden 
the choice of PAMs, and will certainly reveal 
shorter variants more amenable for delivery in 
human cells. 

Furthermore, specific methods for delivering 
Cas9 and its guide RNA to cells and tissues 
should benefit the field of human gene therapy. 
For example, recent experiments confirmed that 
the Cas9 protein-RNA complex can be intro- 
duced directly into cells using nucleofection or 
cell-penetrating peptides to enable rapid and 
timed editing (89, 152), and transgenic organisms 


that express Cas9 from inducible promoters are 
being tested. An exciting harbinger of future 
research in this area is the recent demonstration 
that Cas9-guide RNA complexes, when injected 
into adult mice, provided sufficient editing in the 
liver to alleviate a genetic disorder (153). Under- 
standing the rates of homology-directed repair 
after Cas9-mediated DNA cutting will advance the 
field by enabling efficient insertion of new or 
corrected sequences into cells and organisms. In 
addition, the rapid advance of the field has raised 
excitement about commercial applications of 
CRISPR-Cas9. 

The era of straightforward genome editing 
raises ethical questions that will need to be 
addressed by scientists and society at large. How 
can we use this powerful tool in such a way as to 
ensure maximum benefit while minimizing risks? 
It win be imperative that nonsdentists understand 
the basics of this technology sufficiently well to 
facilitate rational public discourse. Regulatory 
agendes will also need to consider how best to 
foster responsible use of CRISPR-Cas9 technolo- 
gy without inhibiting appropriate research and 
development. 

The Identification of the CRISPR-Cas9 tech- 
nology underscores the way in which many 
inventions that have advanced molecular biology 
and medicine emanated, through basic research 
on natural mechanisms of DNA replication, re- 
pair, and defense against viruses. In many cases, 
key methodologies emerged from the study of 
bacteria The CRISPR-Cas9 technology originated 
through a similar process: Once the mechanism 
underlying how the CRISPR-Cas9 system works 
was understood, it could be harnessed for applica- 
tions in molecular biology and genetics that were 
not previously envisioned. 
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INTRODUCTION: The kingdom Fungi 
is one of the most diverse groups of 
organisms on Earth, and they are integral 
ecosystem agents that govern soil carbon 
cycling, plant nutrition, and pathology. 
Fungi are widely distributed in all terres- 
trial ecosystems, but the distribution of 
species, phyla, and functional groups has 
been poorly documented. On the basis 
of 365 global soil samples from nat- 
ural ecosystems, we determined 
the main drivers and biogeo- 
graphic patterns of fungal 
diversity and commu 
nity composition. 


RATIONALE: We identified soil-inhabiting 
fungi using 454 Life Sciences (Branford, 
CN) pyrosequencing and through compari- 
son against taxonomically and functionally 
annotated sequence databases. Multiple re- 
gression models were used to disentangle 
the roles of climatic, spatial, edaphic, and 
floristic parameters on fungal diversity and 
community composition. Structural equa- 
tion models were used to determine the 
direct and indirect effects of climate 
on fungal diversity, soil chemistry, 
and vegetation. We also exam- 
ined whether fungal biogeo- 
graphic patterns matched 
paradigms derived 
from plants and 



Direct and indirect effects of ciimatic and edaphic variabies on plant and fungal richness. 

Line thickness corresponds to the relative strength of the relationships between the variables 
that affect species richness. Dashed lines indicate negative relationships. MAP, mean annual 
precipitation: Fire, time since last fire; Dist. equator, distance from the equator; Ca, soil calcium 
concentration: P, soil phosphorus concentration; pH, soil pH. 


animals— namely, that species’ latitudinal 
ranges increase toward the poles (Rapo- 
port’s rule) and diversity increases toward 
the equator. Last, we sought group-specific 
global biogeographic links among major bio- 
geographic regions and biomes using a net- 
work approach and area-based clustering. 
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RESULTS: Metabarcoding analysis of 
global soils revealed fungal richness esti- 
mates approaching the number of species 
recorded to date. Distance from equator 
and mean annual precipitation had the 
strongest effects on rich- 
ness of fungi, including 
most fungal taxonomic 
and functional groups. 
Diversity of most fun- 
gal groups peaked in 
tropical ecosystems, but 
ectomycorrhizal fungi and several fungal 
classes were most diverse in temperate or 
boreal ecosystems, and many fungal groups 
exhibited distinct preferences for specific 
edaphic conditions (such as pH, calcium, 
or phosphorus). Consistent with Rapoport’s 
rule, the geographic range of fungal taxa in- 
creased toward the poles. Fungal endemicity 
was particularly strong in tropical regions, 
but multiple fungal taxa had cosmopolitan 
distribution. 


CONCLUSIONS: Climatic factors, followed 
by edaphic and spatial patterning, are the 
best predictors of soil fungal richness and 
community composition at the global scale. 
Richness of all fungi and functional groups 
is causally unrelated to plant diversity, with 
the exception of ectomycorrhizal root symbi- 
onts, suggesting that plant-soil feedbacks do 
not influence the diversity of soil fungi at the 
global scale. The plant-to-fungi richness ra- 
tio declined exponentially toward the poles, 
indicating that current predictions— assum- 
ing globally constant ratios— overestimate 
fungal richness by 1.5- to 2.5-fold. Fungi fol- 
low similar biogeographic patterns as plants 
and animals, with the exception of several 
major taxonomic and functional groups that 
run counter to overall patterns. Strong 
biogeographic links among distant 
continents reflect relatively efficient 
long-distance dispersal compared with 
macro-organisms. ■ 
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of soil fungi 

Leho Tedersoo,**t Mohammad Bahram,^t Sergei Polme/ Urmas Koijalg,^ 

Nourou S. Yorou,® Ravi wyesundera,* Luis Villarreal Ruiz,® Aida M. Vasco-Palacios,® 
Pham Quang Thu,’^ Ave Suya,^ Matthew E. Smith,® Cathy Sharp,® Erki Saluveer,® 
Alessandro Saitta,“ Miguel Rosas,” Taavi Riit,® David Ratkowsky,^^ Karin Pritsch,*® 
Kadri Poldmaa,^ Meike Piepenhring,” Cherdchai Phasri,^* Marko Peterson,^ 

Kaarin Parts,^ Kadri Partel,^ Eveli Otsing,^ Eduardo Nouhra,^® Andre L. Njouonkou,*® 

R. Henrik Nilsson,” Luis N. Morgado,*® Jordan Mayor,^® Tom W. May,^” 

Luiza Majuakim,“ D. Jean Lodge,^^ Su See Lee,“® Karl-Hentik Larsson,^ Petr Kohout,^ 
Kentaro Hosaka,^® Indrek Hiiesalu,^ Terry W. Henkel,^® Helery Harend,^ 

Liang-dong Guo,” Alina Greslehin,^® Gwen Grelet,®® Jozsef Geml,*® Genevieve Gates,” 
William Dunstan,®” Chris Dunk,*® Rein Drenkhan,®* John Deamaley,®^ Andre De Kesel,®® 
Tan Dang,^ Xin Chen,®* Franz Buegger,*® Francis Q. Brearley,®® Gregory Bonito,^ 

Sten Anslan,^ Sandra Abell,®® Kessy Abarenkov* 

Fungi play major roles in ecosystem processes, but the determinants of fungal diversity and 
biogeographic patterns remain poorly understood. Using DNA metabarcoding data from 
hundreds of globally distributed soil samples, we demonstrate that fungal richness is decoupled 
from plant diversity. The plant-to-fungus richness ratio declines exponentially toward the poles. 
Climatic factors, followed by edaphic and spatial variables, constitute the best predictors of 
fungal richness and community composition at the global scale. Fungi show similar latitudinal 
diversity gradients to other organisms, with several notable exceptions. These findings advance 
our understanding of global fungal diversity patterns and permit integration of fungi into a 
general macroecological framework. 


F ungi are eukaryotic microorganisms that 
play fundamental ecological roles as decom- 
posers, mutualists, or pathogens of plants 
and animals; they drive carhon cycling in 
forest soils, mediate mineral nutrition of 
plants, and alleviate carhon limitations of other 
soil organisms. Fungi comprise some 100,000 de- 
scribed species (accounting for synonyms), hut 
the actual extent of global fungal diversity is esti- 
mated at 0.8 million to 5.1 million species (1). 

Globally, the biomass and relative proportions 
of microbial groups, including fungi, co-vary with 
the concentration of growth-limiting nutrients in 
soils and plant tissues. Such patterns suggest that 
the distribution of microbes reflects latitudinal 
variation in ecosystem nutrient dynamics (2-4). 
Richness of nearly all terrestrial and marine mac- 
roorganisms is negatively related to increasing 
latitude (5)— a pattern attributed to the combined 
effects of chmate, niche conservatism, and rates 
of evolutionary radiation and extinction (6). Al- 
though morphological species of unicellular mi- 
crobes are usually cosmopolitan (7), there is growing 
evidence that the distribution of microorganisms 
is shaped by macroecological and community as- 
sembly processes (S). Only a few of these bio- 
geographic processes have been demonstrated 
for fungi at the local scale (9). Despite their enor- 
mous diversity and importance in ecosystem func- 
tion, little is known about general patterns of 
fungal diversity or functional roles over large 


geographic scales. We used a global data set to 
disentangle the roles of climatic, edaphic, floris- 
tic, and spatial variables governing global-scale 
patterns of soil fungal diversity. We also address 
macroecological phenomena and show that fungi 
largely exhibit strong biogeographic patterns that 
appear to be driven by dispersal limitation and 
climate. 

Materials and methods 

We collected 40 soil cores from natural commu- 
nities in each of 365 sites across the world using 
a uniform sampling protocol (Fig. lA and data 
file SI). Most plots (2500 m^) were circular, but 
in steep mountain regions and densely forested 
areas, some plots were oblong. We randomly se- 
lected 20 trees located at least 8 m apart. In two 
opposite directions, 1 to 1.5 m from each tree 
trunk, loose debris was removed from the forest 
floor. Polyvinyl chloride (PVC) tubes (5 cm in di- 
ameter) were hammered into the soil down to 
5 cm depth. These soil cores almost always in- 
cluded fine roots and comprised both the organic 
layer and top mineral soil. Although deep soil 
may contain some distinctive organisms adapted 
to anoxic conditions or low nutrient levels, our 
sampling was limited to topsoil for the following 
reasons. First, in the vast majority of soil types, 
>50% of microbial biomass and biological activ- 
ity occur in the topmost organic soil layer. Second, 
deeper sampling was impossible in shallow, rocky 


soils or those with high clay concentrations and 
hardpans. Third, differences among soil horizons 
may be masked by other variables across large 
geographic scales (10). The 40 soil cores taken in 
each site were pooled, coarse roots and stones 
were removed, and a subset of the soil was air- 
dried at <35°C. Dried soil was stored in zip-lock 
plastic bags with silica gel in order to minimize 
humidity during transit. In the laboratory, we 
ground dried soil into fine powder using head 
beating. 

We extracted DNA from 2.0 g of soil using 
the PowerMax Soil DNA Isolation kit (MoBio, 
Carlsbad, CA) following manufacturer’s instruc- 
tions. We performed polymerase chain reaction 
(PCR) using a mixture of six forward primers (in 
equimolar concentration) analogous to 1183 and 
a degenerate reverse primer analogous to ITS4 
(hereafter referred to as ITS4ngs). We shortened 
and modified forward and reverse primers to 
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Fig. 1. Maps of global sampling and interpolated taxonomic richness of all fungi. (A) Map of global 
sampling. Circles Indicate study sites. (B) Interpolated taxonomic richness of all fungi using IDW 
algorithm and accounting for the relationship with mean annual precipitation (based on the best multiple 
regression model). Different colors depict residual OTU richness of all fungi accounting for sequencing 
depth. Warm colors indicate OTU-rich sites, whereas cold colors indicate sites with fewer OTUs. 


completely match >99.5% of all fungi [except 
-60% of Tulasnellaceae that exhibit highly di- 
vergent 5.8S ribosomal DNA (rDNA) and Micro- 
sporidia that exhibit rearrangements in rDNA] 
(table SI). The ITSTngs primer was tagged with 
one of 110 identifiers (multiplex identifiers, 10 to 
12 bases) that were modified from those recom- 
mended by Roche (Basel, Switzerland) to differ 
by >3 bases, start only with adenosine, and con- 
sist of between 30 and 70% adenosine and thy- 
midine in order to optimize the adaptor ligation 
step. The PCR cocktail consisted of 0.6 pi DNA 
extraet, 0.5 pi each of the primers (20 pmol), 
5 pi 5xHOT FIREPol Blend Master Mix (Solis 
Biodsme, Tartu, Estonia), and 13.4 pi double- 
distilled water. We carried out PCR in four replicates 
using the following thermocycling conditions: an 
initial 15 min at 95°C, followed by 30 cycles at 95°C 
for 30 s, 55°C for 30 s, 72°C for 1 min, and a final 
cycle of 10 min at 72°C. PCR products were 
pooled, and their relative quantity was estimated 


by running 5 pi amplicon DNA on 1% agarose 
gel for 15 min. DNA samples yielding no visible 
band were reamplified by using 35 cycles in an 
effort to obtain sufficient PCR product, where- 
as samples with a very strong band were re- 
amplified with only 25 cycles. It is important to 
use as few cycles as possible to minimize chimera 
formation and to be able to interpret sequence 
abundance in a semiquantitative manner {11). 
We used negative (for DNA extraction and PCR) 
and positive controls throughout the experiment. 
Amplicons were purified with Exonuclease I and 
FastAP thermosensitive alkaline phosphatase 
enz 5 Tnes (Thermo Scientific, Pittsburgh, PA). Pu- 
rified amplicons were subjected to quantity nor- 
malization with a SequalPrep Normalization 
Plate Kit (Invitrogen, Carlsbad, CA) following 
manufacturer’s instructions. We divided nor- 
malized amplieons into five pools that were sub- 
jected to 454 adaptor ligation, emulsion PCR, and 
454 pyrosequencing by using the GS-FLX-i- tech- 


nology and Titanium chemistry as implemented 
by Beckman Coulter Genomics (Danvers, MA). 

Bioinformatics 

Pyrosequencing on five half-plates resulted In 
2,512,068 reads with a median length of 409 bases. 
The sequences were reassigned to samples In 
mothur 1.32.2 (www.mothur.org) based on the bar- 
codes and then trimmed (parameters: minlength = 
300; maxambigs = 1; maxhomop = 12; qwindo- 
waverage = 35; qwindowsize = 50; and bdiffs = 1) 
to exclude short and low-quality sequences, re- 
sulting in 2,231,188 high-quality sequences. We 
used TTSx 1.0.7 (http://microbiology.se/software/ 
Itsx) to remove the flanking 5.8S and 28S rRNA 
genes for optimal resolution of ITS2 clustering 
and removal of compromised and nontarget se- 
quenees. As a filter to remove most of the partial 
sequences, we retained only sequences >99 base 
pairs (bp) in length. Chimera control was exer- 
cised through UCHIME 4.2 (www.drive5.com/ 
uchime). After these filtering steps, 1,397,679 
sequences were retained and further clustered 
at 90.0% and 95.0 to 99.0% sequence similarity 
thresholds (12) as implemented in CD-Hit 4.6.1 
(www.cd-hit.org). Clustering revealed 37,387, 
59,556, 66,785, 77,448, 94,255, and 157,956 taxa 
based on 90.0, 95.0, 96.0, 97.0, 98.0, and 99.0% 
sequence similarity thresholds, respectively. The 
longest sequence of each Operational Taxonomic 
Unit (OTU), based on clustering at 98.0% se- 
quence similarity, was selected as the repre- 
sentative for BLASTn searches (word size = 7; 
penalties: gap = -1; gap extension = -2; and 
match = 1) against the International Nucleotide 
Sequence Databases Collaboration (INSDC; www. 
insdc.org) and UNITE (unite.ut.ee) databases. In 
addition, we ran BLASTn searches against estab- 
lished reference sequences of aU fimgi in 99.0% 
similarity clusters that include third-party taxo- 
nomic and metadata updates (12) as Implemented 
in the PlutoF workbench (13). For each query, we 
considered the 10 best-matching references to 
annotate our global sequences as accurately as 
possible. If no reliable taxon name was available, 
we ran manual BLASTn searches against INSDC 
with 500 best-matching sequences as output. We 
typically relied on 90, 85, 80, and 75% sequence 
identity as a criterion for assigning OTUs with 
names of a genus, family, order, or class, respec- 
tively. Sequence identity levels were raised in 
subsets of Sordariomycetes, Leotiomycetes, and 
Eurotiomycetes, because these taxa contain mul- 
tiple genera and families that have unusually con- 
served internal transcribed spacer (ITS) sequences. 
As a rule, we considered e-values of BLASTn search 
results e~^° reliable to assign sequences to the 
fungal kingdom, whereas those >e~^° were con- 
sidered “unknown.” E-values between e”“ and e“™ 
were manually checked against the 10 best matches 
for accurate assignment. We followed INSDC for 
higher-level taxonomy of eukaryotes (14) and the 
Index Fungorum (www.indexfungorum.org) for 
species through class-level taxonomy of fungi. Our 
group of taxonomic experts assigned each fungal 
genus, family, or order to functional categories 
(data file S2). If different functional categories 


1256688-2 28 November 2014 • vol 346 issue 6213 


sciencemag.org SCIENCE 


RESEARCH \ RESEARCH ARTICLE 


were present within a specific genus, we chose 
the dominant group (>75% of species assigned 
to a specific category) or considered its ecology 
unknown (<75% of species assignable to a sin- 
gle category). AU Glomeromycota were considered 
to be arbuscular mycorrhizal (AM). Taxa were 
considered to be ectomycorrhizal (EcM) if they 
best matched any sequences of known EcM line- 
ages (75) and exhibited sequence length/BLASTn 
scores above lineage-specific thresholds. For sev- 
eral taxonomic groups, we constructed phyloge- 
netic trees to assess the performance of clustering, 
sequence quality of singletons, accuracy of OTU 
separation, and taxonomic assignments (fig. SI). 
In the course of this project, we provided 10,232 
third-party taxonomic reannotations to INSDC 
sequences to improve subsequent identification 
of fungal sequences and made these available 
through the UNITE database. 

Statistical analyses 

Estimates of the mean annual temperature 
(MAT), mean annual precipitation (MAP), soil 
moisture, and soil carbon at 30 arc second reso- 
lution were obtained from the WorldClim data- 
base (www.worldclim.org). Estimates of potential 
evapotranspiration (PET) and net primary produc- 
tivity (NPP) at 30 arc minute resolution were 
obtained from the Atlas of the Biosphere (www. 
sage.wisc.edu/atlas/maps.php). Variation coeffi- 
cients for MAT and MAP were computed based 
on the average monthly values to represent sea- 
sonality of temperature and precipitation. We 
also calculated the difference of MAP to PET in 
order to evaluate the effect of rainfall surplus or 
deficit. On the basis of vegetation type and geo- 
graphical distribution, sites were categorized into 
biogeographic regions and biomes following the 
classification of the World Wildlife Foundation 
(http://worldwildlife.org) with a few exceptions: 
(i) temperate deciduous forests in the Northern 
and Southern hemispheres were treated sepa- 
rately; (ii) tropical montane forests (>1500 m 
elevation) were separated from the tropical low- 
land moist forests; and (iii) grasslands and shrub- 
lands of all geographic origins were pooled. At 
each site, we also determined the age of vegeta- 
tion, time since the last fire, and EcM plant species 
along with their relative contribution to stand 
basal area. EcM plants are usually conspicuous 
trees or prominent shrubs that are relatively easy 
to identify, and their mycorrhizal status is verifi- 
able in the field by using root excavation and 
microscopy. Complete lists of tree species were 
available for <10% of the sites, so we did not 
directly include plant community composition 
parameters in our analyses. 

Concentrations of N, C, '^C/“C, and “N/^^ 
were determined from 1 to 20 mg of soil by 
using GC-combustion coupled to isotope-ratio 
mass spectrometry (16). Concentrations of soil 
calcium, potassium, magnesium, and phospho- 
rus were determined as in (16). Soil pH was mea- 
sured in 1 N KCl solution. 

For analyses of fungal richness, we calculated 
residuals of OTU richness in relation to the square 
root of the number of obtained sequences to 


account for differences in sequencing depth. 
This method outperformed the commonly used 
rarefaction to the lowest number of sequences 
method, which removes most of the data (77). We 
also calculated the richness of major class-level 
taxonomic and functional groups (comprising 
>100 OTUs). We excluded outlying samples dom- 
inated by a few OTUs of molds, which are indicative 
of poor sample preservation (relative abundance 
of sequences belonging to Trichocomaceae >5%, 
Mortierellaceae >20%, or Mucoraceae >20% that 
exceeded three times the mean -i- SD). Although 
these samples were fairly homogeneously distrib- 
uted across the world, they had conspicuously 
lower fungal richness. We also excluded samples 
that yielded less than 1200 sequences per sample. 

To determine the relationship between plant 
and fungal richness, we rehed on co-kriging values 
from the global vascular plant species richness 
data set (18), which covered 96.7% of our sites. 
These scale-free values of plant richness were 
then regressed with residuals from the best-fit 
models for fungal richness and fungal functional 
groups. We further calculated the ratio of relative 
plant richness to fungal richness and fitted this 
ratio with latitude using polynomial functions to 
test the assumed uniformity of plant-to-fungal 
richness ratios at the global scale (7, 19, 20). To 
account for potential latitudinal biases in plant- 
to-fungal diversity estimates, we took into account 
the nonuniform distribution of land surfaces by 
calculating an inverse distance weighting (IDW) 
spatial interpolation of standardized ratios of plant- 
to-residual fungal diversity using the “gstat” pack- 
age in R (21). We then used IDW to interpolate 
total fungal diversity beyond sampling sites by 
accounting for MAP as based on the best-fitting 
multiple regression model. 

Distance from the equator, altitude, age of 
vegetation, time since last fire, climatic varia- 
bles, and concentrations of nutrients were log- 
transformed before analyses in order to improve 
the distribution of residuals and reduce nonlin- 
earity. To account for potential autocorrelation 
effects, we calculated spatial eigenvectors using 
STUM version 4 (22). To determine the best pre- 
dictors of global fungal diversity, we included 
edaphic, climatic, floristic, and spatial variables 
in multiple regression models. Because of the 
large number of predictors, we preselected 16 can- 
didate predictors that were revealed through ex- 
ploratory multiple linear and polynomial regression 
analyses based on coefficients of determina- 
tion and forward selection criteria. The most par- 
simonious models were determined according to 
the corrected Akaike information criterion (AICc), 
which penalizes over-fitting. Last, components of 
the best models were forward-selected to deter- 
mine their relative importance as implemented in 
the “packfor” package in R. 

To test the direct effects of climatic variables 
on richness of fungi and their functional groups, 
and indirect climatic effects (via soil nutrients 
and vegetation), we used Structural Equation 
ModeUng (SEM) in Amos version 22 (SPSS Soft- 
ware, Chicago, IL). Model fits were explored based 
on both test and root-mean-square error of 


approximation (RMSEA). First, we included all 
potentially important variables (inferred from 
both the multiple regression models and corre- 
lations for individual response variables) to con- 
struct separate SEM models. We tested all direct 
and indirect relations between exogenous and 
endogenous variables, including their error terms. 
Then, we used backward elimination to remove 
nonsignificant links to maximize whole-model 
fit. Last, we combined the obtained SEM models 
in a unified path model, following the same elim- 
ination procedure. 

In addition to full models, we specifically tested 
the relationships between OTU richness and dis- 
tance from the equator and soil pH because these 
or closely related variables were usually among 
the most important predictors. For these analy- 
ses, we calculated residuals of richness that ac- 
counted for other significant variables of the 
best models. To address nonlinear relationships, 
we fitted up to fifth-order polynomial functions 
and selected best-fit models on the basis of AICc 
values. 

The relative effects of climatic, edaphic, spatial, 
and floristic variables on the total fungal com- 
munity composition and on particular functional 
groups were determined by using Hellinger dis- 
similarity (calculated if >90% sites were repre- 
sented by >1 shared OTUs), exclusion of all OTUs 
that occurred once, and a multistage model se- 
lection procedure as implemented in the DISTLM 
function of Permanova-i- (www.primer-e.com/ 
permanova.htm). Considering computational 
requirements, 15 candidate variables were 
preselected based on unifactorial (marginal test 
based on largest T^pseudo values) and multifac- 
torial (forward selection) models. Spatial eigen- 
vectors were not included in these analyses 
because they typically were of minor importance 
in variation partitioning analyses and to avoid 
making the models computationally prohibitive. 
Optimal models were selected based on the AICc. 
To obtain coefficients of determination (cumula- 
tive Readjusted) and statistics (i^pseudo and Rvalues) 
for each variable, components of the best mod- 
els were forward selected. In parallel, we pre- 
pared Global Nonmetric Multidimensional Scaling 
(GNMDS) graphs using the same options. Sig- 
nificant variables were fitted into the GNMDS 
ordination space by using the “envfit” function 
in the “vegan” package of R. We also grouped all 
climatic, edaphic, spatial, and floristic varia- 
bles into a variation partitioning analysis by 
integrating procedures in the “vegan” and “pack- 
for” packages of R. Besides group effects, varia- 
tion partitioning estimates the proportion of shared 
variation among these groups of predictors. 

For global biogeographic analyses, we excluded 
OTUs from the order Hypocreales and family 
Trichocomaceae (both Ascomycota) because the 
ITS region provides insufficient taxonomic reso- 
lution, and known biological species are grouped 
together within the same OTU (23). We tested 
the differences among fungal taxonomic and func- 
tional groups for the occurrence frequency (num- 
ber of sites detected) and latitudinal range of 
OTUs using a nonparametric Kruskal-Wallis 
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test and Bonferroni-adjusted multiple compar- 
isons among mean ranks. To test the validity of 
Rapoport’s rule in soil fungi, we calculated the 
average latitudinal range of OTUs for each site 
{24). The average latitudinal range was regres- 
sed with the latitude of study sites hy means of 
polynomial model selection based on the AICc 
criterion. This analysis was run with and with- 
out OTUs only detected at a single site (range = 0). 
Because the results were qualitatively similar, we 
report results including all OTUs. To construct 
biogeographic relationships among major regions 
and biomes, we generated cross-region and cross- 
biome networks based on the number of shared 
OTUs. We excluded occurrences represented by a 
single sequence per site. Ward clustering of bio- 
geographic regions and biomes were constructed 


by using the Morisita-Hom index of similarity, 
which is insensitive to differences in samples size, 
by use of the “pvclust” package of R In this pro- 
cedure, P values are inferred for nonterminal 
branches based on multiscale bootstrap resam- 
pling with 1000 replicates. 

Taxonomic and functional diversity 

Pyrosequencing analysis of global soil samples 
revealed 1,019,514 quality-filtered sequences that 
were separated Into 94,255 species-level OTUs 
(supplementary materials). Altogether, 963,458 
(94.5%) sequences and 80,486 (85.4%) OTUs were 
classified as Fungi. Most other taxa belonged to 
animals (Metazoa, 3.3%), plants (Viridiplantae, 
3.1%), alveolates (Alveolata, 2.8%), and amoebae 
(mostly Rhizaria, 1.3%). Kingdom-level assign- 


ment of 3.8% OTUs remained elusive. The fungal 
subset included 35,923 (44.6%) OTUs that were 
represented by a single sequence; these were re- 
moved from further analyses In order to avoid 
overestimating richness based on these poten- 
tially erroneous sequences (25). The remaining 
44,563 nonsingleton fungal OTUs in our data 
set numerically correspond to approximately 
half of the described fungal species on Earth 
(2). For comparison, there are currently 52,481 
OTUs based on 98.0% similarity clustering of 
all fungal ITS sequences in publicly available 
databases (22). Global soil sampling revealed 
representatives of all major phyla and classes 
of Fungi. Of fungal taxa, Basidiomycota (55.7%), 
Ascomycota (31.3%), Mortierellomycotina (6.3%), 
and Mucoromycotina (4.4%) encompassed the 
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Fig. 2. Relative proportion of fungal sequences assigned to major taxonomic groups in different biomes. 


1256688-4 28 November 2014 • vol 346 issue 6213 


sciencemag.org SCIENCE 


RESEARCH \ RESEARCH ARTICLE 


largest proportion of sequences (Fig. 2), whereas 
the most OTU-rich phyla were the Ascomycota 
(48.7%), Basidiomycota (41.8%), Chsdridiomycota 
(2.3%), and Cryptomycota (s5m. Rozellida; 2.1%) 
(fig. S2 and data file SI). Except for the recently 
described phylum Cryptomycota {26), the rela- 
tive proportions of major phyla correspond to 
the proportional distribution of taxa described 
and sequenced to date (www.indexfungorum. 
org) {12). Below the phylum level, ~6% of all 
fungal OTUs could not be assigned to any known 
class of fungi. Further clustering of unidentified 
fungal sequences at 70% sequence similarity re- 
vealed 14 distinct taxonomic groups comprising 
>7 OTUs, suggesting that there are several deeply 
divergent class-level fungal lineages that have 
not yet been described or previously sequenced. 

Our classification revealed that 10,801 (24.2%) 
fungal OTUs exhibited >98% sequence similar- 
ity, and 33.8% exhibited >97% similarity, to pre- 
existing ITS sequences in public databases. This 
is consistent with {19), reporting 48% of OTUs 
amplified from Alaskan soils with >97% sim- 
ilarity to any database sequences. In our study, 
only 4353 fungal OTUs (9.8%) were matched to 
sequences from herbarium specimens or fully 
described culture collections at >98.0% sequence 
similarity. Although many type collections are 
yet to be sequenced, the paucity of matches to 
database entries indicates that a majority of 
soil-inhabiting fungal taxa remain undescribed 
{19, 20). These results highlight the current lack 
of data from understudied tropical and sub- 
tropical ecosystems. The phenomenon of high 
crsqjtic diversity and low success in naming 
OTUs at the genus or species level have been 
found in other groups of soil microbes and in- 
vertebrates, emphasizing our poor overall knowl- 
edge of global soil biodiversity {27, 28). 

The main fungal phylogenetic and functional 
groups were present in all ecosystems, but their 
relative proportions varied severalfold across 
biomes (Fig. 2 and figs. S2 to S4). The ratio of 
Ascomycota to Basidiomycota OTUs was highest 
in grasslands and shrublands (1.86) and tropical 
dry forests (1.64) but lowest in the temperate 
deciduous forests (0.88). Chytridiomycota, Cryp- 
tomycota, and Glomeromycota were relatively 
more diverse in the grasslands and shrublands, 
accounting for 4.6, 3.6, and 1.4% of OTU rich- 
ness, respectively. The relative OTU richness of 
Mortierellomycotina and Mucoromycotina (in- 
cluding most fast-growing molds but also some 
plant symbionts) peaked in the tundra biome 
(4.8 and 2.7%, respectively), but their abundance 
was lowest in tropical dry forests (1.0 and 0.6%, 
respectively). Archaeorhizomycetes, a recently 
described class of Ascomycetes from a boreal 
forest {29), was most diverse in tropical moist 
and montane forests, particularly in northern 
South America and New Guinea. 

Among all fungal taxa, OTUs assigned to 
saprotrophs, EcM mutualists, and plant patho- 
gens comprised 19,540 (43.8%), 10,334 (23.2%), 
and 1770 (4.0%), respectively (fig. S4). Other 
trophic categories contained <1% of remaining 
OTUs. EcM fungi contributed 34.1% of all taxa 


in the northern temperate deciduous forests 
but accounted for a relatively low proportion 
(11.9%) in grasslands and shrublands, reflecting 
the paucity of host plants In these ecosystems. 
Similarly, the proportion of EcM fungal taxa was 
lowest in northern South America (8.0%), where 
AM trees often dominate. Plant pathogens were 
relatively more abundant and diverse in lowland 
tropical moist (6.2%) and dry (6.3%) forests. 

Predictors of global richness 

Structural equation models revealed that climate 
has both a strong direct effect on plant and 
fungal richness and functional groups, but it 
also indirectly affects these metrics by altering 
edaphic conditions (fig. S5). Both SEM and re- 
gression models suggest that the best predictors 
of diversity differed among phylogenetic and 
functional groups of fungi. Positive effects of 
mean annual precipitation (MAP) and soil cal- 
cium (Ca) concentration were the strongest pre- 
dictors of total fungal diversity, explaining 7.2 
and 8.9% of residual richness, respectively (table 
S2). Richness of EcM fungi responded positively 
to the relative proportion and species richness of 
EcM plants (explaining 18.3 and 8.5% of variance, 
respectively), as well as soil pH (13.0%). EcM host 
species richness (5.9%) and soil pH (20.4%) re- 
mained the strongest predictors in the best mod- 
el for sites with EcM vegetation, accounting for 
>60% of basal area, a critical point above which 
the proportion of EcM plants had no further ef- 
fect on EcM fungal richness. MAP had a strong 
positive effect (14.8%) on richness of saprotrophs. 
Diversity of plant pathogens declined with in- 
creasing distance from the equator (17.8%) and 
soil carbon/nitrogen (C/N) ratio (11.6%). Animal 
parasites responded positively to MAP (20.3%), 
whereas monthly variation of precipitation (MAP 
CV) had a negative impact on richness of myco- 
parasites (fungus-parasitic fungi; 8.2%). Richness 
of the AM Glomeromycota was negatively related 
to the age of vegetation (7.3%) but positively re- 
lated to PET (3.5%) and soil pH (4.3%). Of the 
major taxonomic groups, the richness of Asco- 
mycota in general (18.5%) and that of Archae- 
orhizomycetes (21.7%) were negatively related to 
distance from the equator in best-fit models. 
Climatic variables were the best predictors for 
richness of Mortierellomycotina (MAT, nega- 
tive effect, 26.1%) and the ascomycete classes 
Dothideomycetes (MAT, positive effect, 20.9%), 
Lecanoromycetes (MAT, negative effect, 26.7%), 
Leotiomycetes (MAT, negative effect, 30.1%), 
Orbiliomycetes (MAT, positive effect, 12.8%), 
and Sordariomycetes (MAP, positive effect, 33.4%). 
The richness of Chytridiomycota and the ascomy- 
cete class Pezizomycetes was best explained by a 
positive response to soil pH (8.6 and 40.5%, respec- 
tively). Concentration of soil nutrients or their 
ratio to other nutrients were the strongest pre- 
dictors for OTU richness of Cryptomycota (N con- 
centration, positive effect, 10.1%), Geoglossomycetes 
[N/phosphorus (P) ratio, positive effect, 3.7%], 
Mucoromycotina (C/N ratio, positive effect, 19.0%), 
and WaHemiomycetes (P concentration, negative 
effect, 14.9%). The richness of Basidiomycota and 


its class Agaricomycetes were best explained by a 
positive response to soil Ca concentration (13.5 and 
12.8%, respectively). 

Although geographical distance per se had 
negligible effects on richness (Moran’s I = 0.267), 
spatial predictors were included in the best rich- 
ness models of nearly all functional and phyloge- 
netic groups (except Glomeromycota), indicating 
regional- or continental-scale differences in OTU 
richness (Fig. IB). Compared with other tropical 
regions, richness of fungi was conspicuously lower 
in Africa, independent of biome type. These re- 
sults might reflect the relatively lower MAP in 
much of Africa as compared with other tropical 
continents. Alternatively, lower fungal richness 
could be related to the disproportionately strong 
shifts in biomes during the Pleistocene, which 
impoverished the African flora {18). 

Among edaphic variables, soil pH and Ca 
concentration were typically the most impor- 
tant predictors of fungal OTU richness. These 
variables positively correlated with fungal rich- 
ness at the global scale (7^ 335 = 290.7; ilpearson = 
0.682; P < 0.001). The strong positive influence of 
soil Ca concentration on richness of fungi, in 
particular Basidiomycota, is congruent with a 
similar positive relationship found for Ca and 
EcM fungal richness associated with Northern 
Hemisphere Alnus spp. {30). Exchangeable Ca 
is important for many physiological processes 
in plants and microorganisms and influences 
the turnover rate of soil organic matter {31). In 
soil geochemical processes, pH and Ca concen- 
tration affect each other and thus may have 
both direct and indirect effects on soil biota. 
Fungal functional groups were differentially af- 
fected by pH. Richness of EcM fungi was greatest 
in slightly acidic to neutral soils (fig. S6), whereas 
saprotrophs, especially white rot decomposers, 
were more diverse in moderately to strongly 
acidic soils. Richness of Pezizomycetes peaked 
distinctly in neutral soils. 

Macroecologlcal patterns 

In general agreement with biogeographic pat- 
terns of plants, animals, and foliar endophytic 
fungi (5, 32), the overall richness of soil fungi 
increased toward the equator (Fig. 3A). How- 
ever, major functional and taxonomic groups 
showed dramatic departures from the general 
latitudinal richness patterns (Fig. 3 and fig. S7). 
Namely, diversity of saprotrophic fungi, para- 
sites, and pathogens increased at low latitudes, 
whereas richness of EcM fungi peaked at mid- 
latitudes, especially in temperate forests and 
Mediterranean biomes of the Northern Hem- 
isphere (40° to 60°N) (fig. S8). In contrast, 
saprotrophic fungi had a broad richness peak 
spanning from ~45°S to 25°N. Richness of 
Ascomycota— in particular, that of Archaeorhi- 
zomycetes, Dothideomycetes, Eurotiomycetes, 
Orbiliomycetes, and Sordariomycetes— peaked 
in tropical ecosystems (fig. S7). Conversely, the 
ascomycete classes Lecanoromycetes and Leo- 
tiomycetes, as well as Microbotryomycetes 
(basidiomycete yeasts), Mortierellomycotina, and 
Mucoromycotina increased in diversity toward 
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Fig. 3. Relationships between residual richness of fungal taxonomic or functional groups and distance 
from the equator. (A) All fungi. (B) EcM fungi. (C) Saprotrophic fungi. (D) Plant pathogens. (E) Animal 
parasites. (F) Mycoparasites. (G) White rot decomposers. (H) Yeasts. Lines indicate best-fitting linear or 
polynomial functions. 


the poles, with no noticeable decline in boreal 
forests and tundra blomes. Agaiicomycetes, 
Pezizomycetes, and Tremellomycetes exhibited 
distinct richness peaks at mid-latitudes. Richness 
of Agaricomycetes was greater in the Northern 
Hemisphere, whereas that of Microbotryomycetes, 
Tremellomycetes, and WaUemiomycetes peaked 
in the Southern Hemisphere temperate ecosys- 
tems (fig. S8). 

All of these phylogenetic groups originated 
>150 million years ago on the supercontinent 


Pangaea (33) and have had sufficient time for 
long-distance dispersal. However, our data sug- 
gest that particular regional biotic or abiotic 
conditions (such as soil pH and favorable cli- 
matic conditions) have likely stimulated evolu- 
tionary radiations in certain geographic areas 
and not in others. Adaptation to cold climate in 
younger fungal phyla has been suggested to ex- 
plain differential latitudinal preferences among 
fungal groups (34). However, our global analysis 
provided no support for this hypothesis (fig. S9). 


Instead, it revealed that ancient lineages are rel- 
atively more common in nonwooded ecosystems. 

Relation of plant and fungal richness 

Plant and fungal richness were positively cor- 
related (fig. SIO), but plant richness explained 
no residual richness of fungi according to the 
best regression model (i?\dj < 0-01; P > 0.05). 
These results and SEM path diagrams suggest 
that correlations between plant and fungal rich- 
ness are best explained by their similar response 
to climatic and edaphic variables (covariance) 
rather than by direct effects of plants on fungi. 
However, when separating functional categories, 
trophic groups of fungi exhibited differential 
response to plant diversity and relative propor- 
tion of potential hosts. 

Plant pathogens usually attack a phylogeneti- 
cally limited set of host plants (3S), suggesting 
that plant pathogens have at least partly co- 
evolved with their hosts and may have radiated 
more intensively in the tropics, where high plant 
diversification and richness permit greater diver- 
sification. Strong phylogenetic signals in soil feed- 
backs, adaptive radiation, and negative density 
dependence (the Janzen-Connell hypothesis) have 
probably contributed to the pronounced richness 
of both plants and their pathogens at low lati- 
tudes (36, 37). However, our analyses revealed no 
significant effects of plant richness per se on re- 
sidual richness of pathogens in soil. Similarly to 
pathogens, richness of AM fungi was unrelated 
to the proportion of AM host trees or interpo- 
lated host richness, which may result from non- 
specific associations with tree and understory 
species. Hence, both AM and soil pathogen rich- 
ness were unaffected by plant richness. In con- 
trast, host richness explained 6% of variation in 
EcM fungal richness, indicating either niche dif- 
ferentiation of fungi in forests of mixed hosts or 
sampling effects (forests with higher host diver- 
sity are more likely to Include plant species that 
harbor high fungal diversity). With a few notable 
exceptions, most studies have found low levels of 
host preference or host specificity among EcM 
fungi (38). We found that relative EcM host 
density had a strong influence on EcM fungal 
richness, suggesting that greater availability of 
colonizable roots in soil provides more carbon 
for EcM fungi and thereby yields greater species 
density and local-scale richness regardless of lati- 
tude. The peak of EcM fungal taxonomic and 
phylogenetic richness in northern temperate 
biomes coincides with the geographical distri- 
bution and dominance of Pinaceae, which is the 
oldest extant EcM plant family (75, 39). 

The ratio of plant-to-fungal richness decreased 
exponentially with increasing latitude because 
plant diversity dropped precipitously toward the 
poles relative to fungal diversity (Fig. 4). This 
finding calls into question present global fungal 
richness estimates. These estimates assume sim- 
ilar spatial turnover of plant and fungal species 
and a constant plant-to-fungus ratio and have 
been formulated mostly based on data from tem- 
perate and boreal ecosystems (7, 19, 20). Yet, 
local-scale beta diversity of both plants and fungi 
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Fig. 4. Relationship between standardized plant richness to fungal richness ratio and distance 
from the equator. (A) Interpolated values. (B) Polynomial regression. Residuals of fungal richness are 
taken from the best linear regression model accounting for other significant predictors. Warm colors 
indicate high plant-to-fungal richness ratio, whereas cold colors indicate low plant-to-fungal richness. 



differ among temperate and tropical sites (40, 41), 
and there are profound differences in plant spe- 
cies turnover depending on propagule size (42). 
Natural distribution of very few vascular plant 
species encompass several continents, but there 
are multiple fungal species with circumpolar or 
cosmopolitan distribution (45, 44). Although we 
caimot directly compare plant and fungal beta 
diversity, spatial turnover of plant species is in- 
arguably greater (42). Based on the function of 
fungi-to-plant richness ratio to latitude and lati- 
tudinal distribution of land, we calculated that 
fungal richness is overestimated by 1.5- and 2.5-fold 
on the basis of constant temperate (45° latitude) 
and boreal (65° latitude) richness ratios, respectively. 

Because richness estimates are calculated 
based on the frequency of the rarest species, 
the reliability of singleton data call into question 
biologically meaningful extrapolations (71). In 


metabarcoding studies such as ours, sequenc- 
ing errors tend to give rise to singleton sequen- 
ces, and the number of rare artificial taxa grows 
rapidly with increasing sequencing depth (25). 
Therefore, despite the size of our data set, it can- 
not readily be used to produce reliable taxonomic 
richness extrapolations. 

Community ecology 

Variation partitioning analysis revealed that 
climatic, edaphic, and floristic variables (and 
their shared effects) are the strongest predic- 
tors for community composition of all fungi and 
most of their functional groups (fig. Sll). How- 
ever, the saprotroph community composition 
was most strongly explained by purely spatial 
variables. More specifically, PET and soil pH 
explained 2.4 and 1.5%, respectively, of the var- 
iation in total fungal community composition 


(table S3 and fig. S12). PET contributed 3.8, 2.8, 
and 11.7% to community structure of saprotrophs, 
plant pathogens, and yeasts, respectively. Distance 
from the equator (1.3%) and soil pH (0.7%) were 
the strongest predictors of EcM fungal commu- 
nity composition, whereas mean annual temper- 
ature (4.0%) was the strongest predictor for 
animal parasites, and distance from the equa- 
tor (3.5%) was the best predictor for mycopar- 
asites (table S3 and fig. S12). 

These results indicate that both environmen- 
tal and spatial predictors generally have a minor 
influence on species-level composition of fungi at 
the global scale. Nonetheless, the significant global- 
scale pH effect in several groups of fungi is 
consistent with the substantial influence of pH 
on the phylogenetic structure of soil fungal and 
bacterial communities in both local and con- 
tinental scales (27, 45). The relatively stronger 
climatic and edaphic drivers of richness at the 
class and phylum level suggest that phylogenetic 
niche conservatism in fungal lineages is similar 
to cross-biome distribution patterns in vascular 
plants (46) and protists (47). 

Global biogeography 

Consistent with Rapoport’s rule formulated for 
macro-organisms (24) and later applied to ma- 
rine bacteria {48), the mean latitudinal range of 
fungi strongly increased toward the poles (fig. S13). 
These results also suggest that a greater propor- 
tion of fungi are endemic within tropical rather 
than extratropical ecosystems. 

Major taxonomic and functional groups of 
fungi differed markedly in their distribution 
range (figs. S14 and S15). T^imal parasites were 
more widely distributed as compared with aU other 
groups, suggesting that there are many general- 
ist OTUs with global distribution. Saprotrophs 
and plant pathogens had broader distribution 
ranges than EcM and AM root symbionts. Taxa 
belonging to Mortierellomycotina, Mucoromy- 
cotina, Tremellomycetes, and Wallemiomycetes— 
groups that include a large proportion of sapro- 
trophs and parasites that produce exceptionally 
large quantities of aerially dispersed mitospores— 
were generally most widely distributed. Besides 
the AM Glomeromycota, OTUs belonging to the 
ascomycete classes Archaeorhizomycetes, Geo- 
glossomycetes, and Orbiliomycetes were detected 
from the fewest sites. 

The northernmost biogeographic regions (Europe, 
West Asia, East Asia, and North America) had 
the most similar fungal communities as re- 
vealed by shared fungal OTUs (Fig. 5). Accord- 
ing to the Morisita-Horn similarity index, the 
northern and southern temperate regions clus- 
tered together with marginally nonsignificant 
support {P = 0.064) (Fig. 6A). In spite of the large 
geographical distance separating them, paleo- 
and neotropical biogeographic regions clustered 
together (P = 0.059). However, biogeographic 
clustering of regions deviated markedly in cer- 
tain functional groups of fungi (Fig. 6). For in- 
stance, EcM fungi in the southern temperate 
and tropical regions had greater similarity as 
compared with northern temperate ecosystems 
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Fig. 5. Connectedness of biogeographic regions by shared OTUs of EcM fungi, saprotrophs, and piant pathogens. Blue. EcM fungi: Black, 
saprotrophs: and red, plant pathogens. The width of lines and diameter of circles are proportional to the square root of the number of connections and 
sample size (number of sites), respectively. Numbers in circles indicate the number of OTUs found in each region. OTUs with a single sequence per site 
and OTUs belonging to Hypocreales and Trichocomaceae (in which the ITS region is too conservative for species-level discrimination) were excluded. 


(P = 0.001). Among biomes, boreal forests, tem- 
perate coniferous forests, and temperate decid- 
uous forests shared the largest numbers of fungal 
OTUs (fig. S16). Fungal OTUs in temperate decid- 
uous forests were highly similar to Mediterranean 
and tropical montane forests, whereas fungal 
OTUs in tropical montane forests were linked 
to tropical moist forests, which in turn exhibited 
substantial connections with tropical dry forests 
and savannas. As a result, cluster analysis sup- 
ported separation of tropical and nontropical 
biomes (Fig. 6B). Consistent with biogeographic 
region-level analysis, lowland tropical hiomes, 
arctic tundra and boreal forests biomes, and 
temperate biomes formed three well-supported 
clusters. Tropical montane forests and grasslands 
and shrublands were clustered with temperate 
biomes according to distribution of all fungi and 
most functional groups. However, in EcM fungi, 
taxa from southern temperate forests, tropical 
montane forests, and grass/shrublands clustered 
with tropical lowland and Mediterranean bio- 
mes. A relatively large proportion of EcM fungal 
taxa were shared across various biomes in 
Australia and New Guinea, which explains these 


deviating patterns. In contrast, plant pathogens 
from tropical montane forests clustered with 
tropical lowland biomes rather than with tem- 
perate hiomes. 

Our biogeographic analyses complement the 
community-level results, suggesting that both 
climate and biogeographic history shape mac- 
roecological patterns of fungi. Comigration with 
hosts over Pleistocene land bridges (such as 
Beringia, WaUacea, and Panamanian) and long- 
distance dispersal by spores appear to have 
played important roles in shaping current fungal 
distribution patterns (30, 43). The relative in- 
fluence of climate and biotrophic associations 
with host plants of varying extant distributions 
probably contribute to differences in the range 
and hiogeographic relationships among fungal 
functional groups (49). In addition, taxon-specific 
constraints for dispersal, such as shape and size 
of propagules and sensitivity to ultraviolet light, 
may differentially affect long-distance dispersal 
among taxa (7). For instance, Glomeromycota 
OTUs, which form relatively large non-wind- 
dispersed asexual spores, had the lowest average 
geographical range. In general, region-based dis- 


tribution patterns of fungi are somewhat con- 
flicting with clustering of plants and animals, 
where Holarctlc lineages are deeply nested with- 
in larger tropical groups (50). Consistent with 
macroorganisms, fungi from the Southern Hem- 
isphere temperate landmasses cluster together. 
Differences observed in macroecological patterns 
among fungi, plants, and animals may originate 
from the relative strength of dispersal limitation 
and phylogeographic history, but exaggeration 
from methodological differences among studies 
cannot he discounted. The use of homogenous 
sampling and analytical methods, as done in this 
study, are necessary to confidently compare mac- 
roecological patterns among distinct life forms 
and to reliably test degrees of consistency among 
aU kingdoms of life. 

Conclusions and perspectives 

Climatic variables explained the greatest propor- 
tion of richness and community composition in 
fungal groups hy exhibiting both direct and 
indirect effects through altered soil and fioristic 
variables. The strong driving climatic forces iden- 
tified here open up concerns regarding the 
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Fig. 6. Ward clustering of biogeographic regions and biomes based on the Morisita-Horn pairwise 
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(E and F) Saprotrophs. (G and H) Plant pathogens. Numbers above branches indicate P values. 


impact of climate change on the spread of dis- 
ease (57) and the functional consequences of 
altered soil microorganism communities (52). 
The observed abrupt functional differences be- 
tween fungal communities in forested and tree- 
less ecosystems, despite spatial juxtaposition, 
suggests that plant life form and mycorrhizal 
associations determine soil biochemical pro- 
cesses more than plant species per se. Loss of 
tree cover and shrub encroachment resulting from 
drying and warming may thus have a marked 
impact on ecosystem functioning both above- 
and belowground. 

In addition to natural mechanisms, such as 
long-distance dispersal and migration over past 
land bridges, global trade has enhanced the spread 
of some non-native soil organisms into other eco- 
systems, where they sometimes become hazardous 
to native biota, economy, and human health {53). 
Our results highlight how little insight we still 


have into natural microbial distribution patterns, 
and this undermines our ability to appraise the 
actual role of humans in shaping these biogeo- 
graphic processes. Even larger-scale sampling 
campaigns are needed to provide data for es- 
tablishing natural distributions and building 
species distribution models (52), which will 
enable us to predict the spread and habitat suit- 
ability of non-native microorganisms. 
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of T cell sign a ling in single-cell data 
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INTRODUCTION: Cellular circuits sense 
the environment, process signals, and com- 
pute decisions using networks of interact- 
ing proteins. Emerging high-dimensional 
single-cell technologies such as mass cy- 
tometry can measure dozens of protein 
epitopes simultaneously in millions of in- 
dividual cells. With thousands of individ- 
ual cells, each providing a point of data on 
co-occurring protein states, it is possible to 
infer and quantify the functional forms of 
the relationships between proteins. How- 
ever, in practice these underlying relation- 
ships are typically obscured by statistical 
limitations of the data, hence rendering the 
analysis and interpretation of single-cell 
data challenging. We developed computa- 
tional methods, tailored to single-cell data, 


to more completely define the function and 
strength of signaling relationships. 

RATIONALE: We demonstrate the utility 
of our methods using single-cell data col- 
lected from T cells. Although T cell subpop- 
ulations are phenotypically delineated into 
several cell subsets— such as regulatory, 
effector, and memory— and are thought to 
have similarly wired signaling networks, 
their responses to activation differ in ways 
that are not understood. 

RESULTS: We used mass cytometry to mea- 
sure the abundance of 20 internal and sur- 
face protein epitopes, at 13 time points, after 
two different types of T cell receptor (TCR) 
activation in T cells of B6 mice— resulting 



o^dHi score 0 high-DREMI time point 

Quantitative characterization of T ceii signaling. (A) The pCD3^-pSLP76 signaling interac- 
tion shown as(l)ascatterplot, (II) a kernel density estimate, and (III) by using a conditional DREVI 
method. (IV) Shape features are extracted and quantified. (B) DREVI plots of a signaling cas- 
cade downstream of TCR show the time-varying nature of edge shapes and strengths. (C) Edge 
strengths are quantified by using conditional DREMI. 


Higfi Density Low 


in more than 2 million data points. To study 
TCR signaling, we developed Conditional- 
Density Resampled Estimate of Mutual Infor- 
mation (DREMI) to quantify the strengths of 
the influence that a protein X has on protein 
Y, and conditional-Densify Rescaled Visual- 
ization (DREVI) to visualize and characterize 
the edge-response function underlying their 
molecular interaction. A key conceptual shift 
in DREMI and DREVI is our use of the con- 
ditional probability of YjX rather than the 
joint probability of X and F. We show that 
the consensus F-response for each value ofX 
is much easier to Identify in the conditional 
density estimate, especially when the joint 
density is concentrated in a narrow range, 
which is typical of such data (Fig. lA). 

We used DREMI to characterize the 
rapid dynamics of signaling interactions 
upon TCR activation (Fig. IB) and show 
that the strength of signal transfer peaks 

in canonical pathway 

order (Fig. 1C). We com- 
pared edges in naive 
and antigen-exposed 
CD4+ T cells and identi- 
fied differential signal 
transmission along a key 
signaling cascade that starts at pCD3^ and 
continues through pSLP76, pERK, and pS6. 
At each stage in this cascade, more informa- 
tion (higher DREMI) is transferred down- 
stream from one protein to another, over 
a longer time period, in naive cells than in 
antigen-exposed cells. We validated our char- 
acterization in mice lacking the extracellular- 
regulated mitogen-activated protein kinase 
(ERK2), demonstrating stronger influence of 
pERK on pS6 in naive cells, as predicted. 


ON OUR WEBSITE 


Read the full article 
at http://dx.dol 
.org/10.1126/ 
science. 1250689 


CONCLUSION: DREMI solves a challenging 
problem: quantifying the strength of the un- 
derlying complex relationships between pro- 
teins from noisy data. Our approach reveals 
how signaling is fine-tuned between T cell 
subpopulations: The differences we identi- 
fied between naive and antigen-exposed T 
cells suggest that naive cells more sensitively 
transmit upstream signaling inputs along a 
key signaling cascade. In contrast, trained ef- 
fector or memory cells seem poised for fast 
responses upon repeated exposure. 

DREVI and DREMI are broadly appli- 
cable across biological systems and single- 
cell technologies. As single-cell data become 
more abundant, our methods will enable 
the construction of quantitative models of 
cellular signaling and comparison between 
healthy and diseased cells. ■ 
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Conditional density-based analysis of 
T cell sign a ling in single-ceU data 

Smita Krishnaswamy/ Matthew H. Spitzer,^ Michael Mingueneau,^ Sean C. Bendall,^ 
Oren Litvin/ Erica Stone,* Dana Pe’er,**t Garry P. Nolan^f 

Cellular circuits sense the environment, process signals, and compute decisions using 
networks of interacting proteins. To model such a system, the abundance of each activated 
protein species can be described as a stochastic function of the abundance of other 
proteins. High-dimensional single-cell technologies, such as mass cytometry, offer an 
opportunity to characterize signaling circuit-wide. However, the challenge of developing 
and applying computational approaches to interpret such complex data remains. Here, 
we developed computational methods, based on established statistical concepts, to 
characterize signaling network relationships by quantifying the strengths of network edges 
and deriving signaling response functions. In comparing signaling between naTve and 
antigen-exposed CD4* T lymphocytes, we find that although these two cell subtypes had 
similarly wired networks, naTve cells transmitted more information along a key signaling 
cascade than did antigen-exposed cells. We validated our characterization on mice 
lacking the extracellular-regulated mitogen-activated protein kinase (MARK) ERK2, which 
showed stronger influence of pERK on pS6 (phosphorylated-ribosomal protein S6), in 
naTve cells as compared with antigen-exposed cells, as predicted. We demonstrate that 
by using cell-to-cell variation inherent in single-cell data, we can derive response 
functions underlying molecular circuits and drive the understanding of how cells 
process signals. 


C ells process external cues through the hi- 
ological circuitry of signaling networks in 
which each protein species processes infor- 
mation pertaining to other proteins, whose 
activities themselves are determined hy 
biochemical modifications (such as phosphoryl- 
ation) or other allosteric interactions. Signaling 
networks can be remarkably attuned to distin- 
guishing subtle features of stimuli to enable key 
decisions regarding cellular response or fate. For 
example, naive CD4'^ T cells take into account 
both dose and duration of T cell receptor (TCR) 
engagement, the strength of peptide binding in 
the major histocompatibihty complex (MHC) cleft, 
and coreceptor cues in making a decision to 
differentiate into either regulatory or T helper 
cells (1-4). With this example as one among many, 
it follows then that to properly understand normal 
cellular responses and how these are dysregulated 
in disease, robust quantitative characterizations 
of signaling relationships will be required to en- 
able more accurate models of signaling. 


'Department of Biological Sciences, Department of Systems 
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and Immunology, Stanford University, Stanford, CA, USA. 
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Department of Cellular and Molecular Medicine, University of 
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Despite progress in the quest to understand 
and represent the complexities of signaling biol- 
ogy, graph diagrams typically used as depictions 
of signaling relationships only offer qualitative 
abstractions. In such graphs, the vertices corre- 
spond to proteins, and a directional edge indi- 
cates the influence of one protein or molecular 
species on another, and as such, these graphs fail 
to capture many of the more complex ways 
through which signaling networks process in- 
formation. Further, such representations are not 
designed to readily enable predictions of response 
to stimuli or therapeutic intervention. Although 
quantitative models have been proposed to de- 
scribe signaling networks (5, 5, 6), these are spe- 
cific to each system and require measurements 
of biochemical rates and many additional param- 
eters. To scale to a large number of signaling 
networks and ceU types, a robust data-driven ap- 
proach that can quantify signaling interactions 
in molecular circuits is required. A data-driven 
approach would take advantage of statistically 
relevant differences in complex cell populations 
so as to better inform the function that is en- 
coded by an inferred circuit diagram. 

To this end, single-cell measurement technol- 
ogies can offer quantitatively precise, even ab- 
solute (given appropriate probes and experimental 
design), measures of dozens of cellular compo- 
nents representing important biochemical func- 
tions. Variation in a complex ceU population can 
be discerned in a functionally relevant context 
and enable insights into the underlying relation- 


ships between signaling molecules. Mass cytom- 
etry, for example, can assay the abundance of 
dozens of internal and surface protein epitopes 
simultaneously in millions of individual cells 
(7, 8), offering an opportunity to quantitatively 
characterize signaling at circuit-wide scales. Mod- 
eling a signaling network as a computational sys- 
tem, in which each signaling protein computes a 
stochastic function of other proteins, and treat- 
ing each single cell as an example of possible 
input-out enables the recovery of how a signaling 
network functions. With many thousands of in- 
dividual cells, each providing a point of data 
about relationships between proteins, we can 
infer the network function. 

However, a major challenge in deciphering 
single-cell signaling data are developing compu- 
tational methods that can handle the complexity, 
noise (which can be either natural stochasticity 
or actual instrument noise), and bias in the mea- 
surements. First, because cell populations are rare- 
ly homogeneous, different cell subpopulations 
can manifest distinct behaviors— and therefore, 
the relationships between signaling proteins may 
be obscured beneath a mixture of multiple net- 
work states. For example, naTve primary B cells 
can have weak and stochastic responses to stimuh 
so that only a small fraction of the population 
responds (via activation of signaling pathways), 
whereas memory B cells are considered primed 
and even evolved toward a more avid binding of 
antigen. Similarly, naive T cells manifest differ- 
ent kinetics of response to T cell receptor engage- 
ment than do effector T cells. Second, technical 
noise in the measurements can further confound 
the quantification of molecular interactions. Third, 
marker abundance (which often correlates with 
cell size) can lead to biased correlations and hence 
be misinterpreted as an influence between the 
assayed signaling proteins. 

We addressed these challenges by developing 
an algorithm, based on the statistical concepts of 
conditional probability (9) and density estima- 
tion (10), termed “conditional-Density Resam- 
pled Estimate of Mutual Information” (DREMI) 
to quantify the strength of molecular interac- 
tions. Given a relationship between two proteins, 
where X influences Y, DREMI considers the abun- 
dance or activity of protein F as a stochastic 
function of the abundance or activity of protein 
X. DREMI uses the variation in a population of 
individually measured single cells to quantify the 
amount of information transmitted from protein 
Xto protein Fin the signaling network. 

A key conceptual shift compared with previ- 
ous approaches to single-cell analysis is that 
DREMI computes mutual information on the es- 
timated “conditional probability” of F | X rather 
than the “joint probability” of X and F (the lat- 
ter being the preferred approached in most other 
mutual information-based metrics). Joint proba- 
bility describes the density of cell states (such as 
in a traditional scatter or density plot), whereas 
eonditional probability describes how the state 
of F varies with different states of X. To explore 
such relationships, we couple DREMI to an al- 
gorithm we term “conditional-Density Rescaled 
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Visualization” (DREVI). We use DREVI to visual- 
ly render the estimated conditional density func- 
tion as a rescaled heat map, which allows us to 
visuaUze the function underlying the molecular 
interaction in the cell population. 

T cells offer an opportune system within which 
to characterize signaling relationships because 
several well-characterized but subtly distinct 
T cell subsets exist (such as regulatory, effector, 
and memory)— each of which have a distinct func- 
tion, and yet the signaling distinctions between 
them are not well appreciated. For example, in 
naive CD4'^ T cells, which have not been exposed 
to antigen, the activation of the TCR (dependent 
on the engagement of coreceptors and availabil- 
ity of certain cytokines) leads to differentiation 
into functionally distinct effector cell types (17). 
In response to the same TCR activation, antigen- 
experienced effector or memory T cells tend to 
proliferate and mount responses more rapidly and 
in greater magnitude than naive T cells {12-14). 
However, although the wiring diagrams that broad- 
ly describe the basic pathways are partly under- 
stood, the molecular mechanisms that lead to 
these differences are far from fully resolved. 

Using a combination of DREMI and DREVI, 
we modeled signaling in pathways downstream 
of the TCK We find that the functional form of 
interactions between signaling proteins demon- 
strates sigmoidal behaviors that are dynamically 
and systematically altered upon activation and 
with increases in stimulus strength, leading to 
complex but definable outcomes. We applied this 
to a physiological case using mass cytometry to 
track the abundance of 11 phosphoproteins after 
stimulation, ranging from 30 s to several hours, 
of mouse TCR-activation in more than 2 mOlion 
individual cells. A comparison of T cell subtypes 
reveals subtle reconfigurations in the strength 
and functional shape of signal transfer between 
previously known pairs of interacting signaling 
molecules, all of which affects the response to 
TCR activation. Our model enabled predicting 
the outcome of a perturbation that quantitatively 
differed between T ceU subtypes that we subse- 
quently verified experimentally. The computa- 
tional iramework developed here can he generally 
applied to model any signaling system for which 
appropriate antibodies, binding agents, or surro- 
gate readouts are available. The approach dis- 
cerns signaling behaviors that exist in complex 
ceU populations and, in a departure from our pre- 
vious work (75), uses the inherent stochasticity to 
quantify and characterize the underlsdng signal- 
ing behaviors, demonstrating that computation 
can help reveal mechanistic insight from multi- 
dimensional single-ceU data sets. 

Results 

The dynamics of T ceU signaling was characterized 
and compared across mouse T ceU subsets. Time- 
series data was collected for phospho-signaling 
proteins in T-lymphocyte populations of B6 mice 
(IS) after stimulation of the TCR (by cross-Unking 
TCR and the CD28 costimulator with biotinyl- 
ated antibodies). We assayed the abundance of 
nine surface markers and 11 phospho-epitopes, 


among which were key nodes in TCR signaling 
and related pathways (fig. SI and table SI). Sam- 
ples were collected at 13 time points after TCR 
activation ranging from 30 s to 80 min with two 
different types of stimuli (CD3/CD28 and CD3/ 
CD4/CD28). We measured -10,000 cells in each 
sample, cumulatively resulting in deep profiling 
of signaling data for more than 2 million indi- 
vidual cells. 

The nine surface markers in our panel distin- 
guish between T cell subsets, including naive 
CD44*°'^ and antigen-experienced CD44’^ T cells 
(fig. S2) (77, 18). Overall, with 20 markers per 
individual cell, these studies simultaneously re- 
solve six T cell subsets in the lymph nodes along 
with their signaling responses, thus reducing tech- 
nical limitations inherent in the assay of post- 
sorted T cell subsets. 


Conditional density-rescaled 
visualization of singie-ceil data 

To analyze these data, we considered each indi- 
vidual cell in our mass cytometry data as an in- 
stance of co-occurring protein states and used 
this information to quantify the signaling rela- 
tionship between the assayed protein parameters. 
We demonstrate our approach using a signaling 
relationship between two measured protein epi- 
topes in our data. A primary locus for intracel- 
lular signaling initiation of the TCR activation 
cascade is a protein scaffold known as the cluster 
of differentiation 3 zeta-chain (CD3Q. Upon en- 
gagement with antigen, CD3^ is phosphorylated 
(to pCD3Q, which creates a scaffold target for the 
zeta-chain-associated protein kinase 70 (ZAP-70), 
which in turn phosphorylates SLP76 (SH2 do- 
main containing leukocyte protein of 76 kD) and 
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Legend: Outline of the DREVI method. All panels 

represent the relationship between pCD3^ and pSLP76 
Density in naive T lymphocytes from B6 mice with CD3 

High Low and CD28 before stimulation (A, I to D, I) and 30 s after 

stimulation (A, II to D, II). (A) Each dot in the scatterplot 
represents the value of a single cell, where the x axis represents the amount of pCD3^ levels, and they axis 
represents the amount of pSLP76 levels. (B) The same data are shown using kernel density estimation, 
with red corresponding to dense regions and blue corresponding to sparse regions. The x axis is partitioned 
into slices (in practice, we use 256 such slices). (C) A DREVI plot of the same data, renormalizing the 
density estimate to obtain a conditional-density estimate of the abundance of pSLP76 levels given the 
abundance of pCD3^ levels. Within each column, dark red (maximal color) represents the highest density in 
that slice. A response function (white curve) is fit to the region of highest conditional density. The top bar 
plots the marginal distribution of pSLP76 levels, and the right bar plots the marginal distribution of pS6 
levels. (D) Following curve fitting, the inflection point, upper plateau, and AUC can be computed and 
compared between conditions. 
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LAT [the adaptor protein SH2 domain con- 
taining leukocyte protein and Linker of activated 
T cells, respectively (i5)]. 

When the relation between pCD3^ and pSLP76 
prestimulation and 30 s after stimulation (Fig. lA) 
is visualized as a scatterplot (a traditional means 
used to analyze flow cytometry data), it is difficult 
to identify any clear characteristics of the data 
other than the range of expression values. Visual- 
izing the data by using a density plot (Fig. IB) (iO), 
one observes that the abundance of pCD35 and 
pSLP76 is concentrated at a particular value, but 
there is no clear statistical dependency between 
the two molecules, nor does this visualization meth- 
od illuminate the nature of the well-researched 
influence that pCD3^ has on pSLP76 (20). 

Why do traditional methods of visualizing 
data, such as scatterplots or density estimations, 
fail to reveal clear relationships between mole- 
cules? In Fig. 1, the majority of cells can be found 
in a narrow range of values for each marker. There- 
fore, the joint density of pCD3^ and pSLP76 is 
dominated by a narrow set of values and does 
not reveal the overall “functional response”— how 


protein Y changes as a function of the activity of 
proteinX To understand Fas a function ofX, we 
have to explore Xs full dynamic range and how 
values of Y are dependent on values ofX 
To visualize and characterize signal transfer 
between proteins X and F, we developed DREVI 
(Fig. 1, Box 1, and materials and methods), which 
represents in a visual form the stochastic func- 
tion of howX influences F By using the natural 
variation in the amount of X and F from cell to 
cell, we empirically learn the probability density 
P(X,Y) from the data. To characterize the influ- 
ence that X extends on F, we shift from the joint 
density and empirically estimate the conditional 
density P(F[X) (materials and methods) by means 
of a kernel density estimation method based on 
heat diffusion (10, 21). The conditional density 
enables us to ascertain how Fs values are chang- 
ing with respect to the values of X, regardless of 
where the majority of cells are concentrated in 
the joint distribution. Although statisticians have 
previously developed sophisticated methods to 
directly compute the conditional density (22), 
these perform poorly on our data (fig. S3). 


DREVI plots are shown in Fig. 1C that depict 
pSLP76 abundance (F) when conditioned on 
pCD3^ (X) abundance. This view of the data re- 
veals a distinct difference in the relation between 
pCD3^ and pSLP76 before and after TCR activa- 
tion that was not apparent in the scatterplots or 
joint distribution. Given an equal amount of pCD3^, 
we observe that pSLP76 has a stronger response 
to increasing pCD3^ levels after TCR activation 
(in Fig. 1C, the increased values of the median 
response are represented by the white line). This 
suggests that after TCR activation, additional 
factors (perhaps ZAP70 or LAT) modulate the 
relationship between pCD3^ and pSLP76. Sim- 
ilarly, DREVI can help clarify the relationship 
between additional protein pairs (fig. S4) and 
visualize how the relationships change through 
time (fig. S5). 

Conditional density-resampled 
estimate of mutuai information 

To systematically compare between conditions, 
time points, and cell types, a measure that quan- 
tifies the relationship between two proteins is 
needed. Often, mutual information (Ml)-based 
metrics are used to evaluate relationships between 
gene or protein pairs. However, MI is difficult 
to compute on continuous data. To resolve this, 
a first step is to discretize the data. Currently, 
adaptive partitioning (23) is one of the most 
widely used approaches for such discretization 
(24, 2S). However, adaptive partitioning assumes 
that denser regions of the data are more impor- 
tant than sparser regions, and therefore, the dense 
regions are partitioned more finely than are sparse 
regions and dominate the resultant mutual- 
information metric. In contrast, we developed a 
measure that allows for sparser populations to be 
accounted for, thus preventing bias against small 
cell populations that could have distinct and in- 
teresting biology that might inform understand- 
ing of the signaling relations in question. 

To quantify the strength of the influence pro- 
tein X has on protein F, we developed DREMI 
(Box 2, Fig. 2, and materials and methods). Like 
MI (2d), DREMI is a shape-agnostic measure that 
scores how predictive X is of F, but— unlike MI— 
it is not symmetric (and therefore might inform 
directionality) and also captures the strength of 
this relationship over all populated regions of the 
dynamic range, regardless of the (often peaked) 
distribution of Xin the data. This is achieved by 
computing mutual information on the condi- 
tional density estimate of the data rather than 
the raw data itself. DREMI begins with the con- 
ditional density estimate, computed for DREVI 
(Box 1). The data are then resampled evenly 
through the range of the conditional probability 
density, and MI is computed on the resampled 
data spanning the entire range. 

To understand the differences between MI 
and DREMI, we use the data from Fig. 1. MI 
works well for data that is well distributed across 
the range of X and F (Fig. 2A, for example); by 
equipartitioning slices ofX, the range of F dras- 
tically drops within each slice, and therefore, 
knowing the value of X provides substantial 


Box 1. Conditional density-rescaled visualization. 

DREVI reveals the influence of protein X on protein Y. Most methods for visualizing pairwise 
data (such as scatterplots and density contours) depict the joint probability of X and Y. In 
contrast, we reveal signaling behavior along its full dynamic range by visualizing conditional 
probability. The main steps of DREVI are as follows: 

(i) Compute the joint kernel density estimate f(x,y) (10). 

(ii) Compute the marginal density of X, f(x), and the conditional density estimate of /given X 
as f(y\x) = on a fine grid of points G = {(x,-. y, ), 1 < / < n, 1 <y < m } that span the range of X 
and Y (2. 12, 15). 

(iii) Rescale each value of the conditional density estimate by its column-maximum to obtain 
f*(yi\Xi) = f(y;|x,)/maxk[f(yk|x,)] 

(iv) Visualize f*(G) as a heatmap, adding side-bars depicting the marginal densities of X and /. 


Box 2. Conditional density-resampled estimate of mutual information. 

DREMI provides a score for the strength of the influence protein X has on protein /. In many 
physiological conditions, only a small fraction of the cells have activated protein X in response 
to stimuli, and these active populations have little influence on the mutual information metric. 
DREMI explicitly factors these populations by computing a score based on the conditional 
distribution of / | X rather than joint distribution. DREMI estimates the computation F. given 
by F(x.y) = ZZp(y|x)log[Pii;^] 

DREMI is computed as follows: 

(i) Begin with the rescaled conditional-density estimate f*(G) as in DREVI (Box 1), computed 
on the grid of points G = {(x,, y, ), 1 < / < n, 1 < y < m}. 

(ii) Round values f*(Xi,y,) < e to 0, for user-defined threshold e, to eliminate technical noise 
from the measurements. 

(iii) Resample from G according to the conditional density estimate f(G). 

(iv) Equi-partition the full range of the data and calculate mutual information, using this 
partition as the discretization. 
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information on the likely value of Y. However, in 
distributions typical of single-cell data (Fig. 2B), 
MI is dominated by a peak in density (a narrow 
range of Z) in which only minor changes in Y are 
observed; thus, the relationship between Zand F 
is obscured. In contrast, DREMI (Fig. 2C) resam- 
ples from the conditional density estimate and 
equally weighs aU regions along the entire range 
of Z, as long as there are enough cells to form a 
robust conditional density estimate. Thus, DREMI 
takes into account sparse populations along the 
X axis and factors for the full range. 

DREMI takes advantage of mass (ytometry’s 
ability to collect large sample sizes (data for mil- 
lions of cells), facilitating the estimation of the 
conditional distribution in relatively sparse re- 
gions. For example, with 50,000 cells, an Z range 
with only 1% of the data (typically treated as an 
outlier) still contains 500 cells, a sufficient num- 
ber to robustly estimate P{Y | Z = a;). To further 
ensure a robust estimation of P{Y | Z = a;), 
DREMI incorporates an automated noise-filtering 
step in which points are eliminated if they have a 
low rescaled conditional density. 

We evaluated the robustness of DREMI to 
noise (fig. S6) and subsampling (fig. S7). As shown 
in fig. S6, DREMI values generally decrease lin- 
early as the SD of the noise increases, and the 
noise-elimination step can be used to adjust sig- 
nal detection in noisy experimental conditions. 
A key strength of DREMI is that it is similarly 
resilient to noise, whether the underlsdng func- 
tion is linear or sigmoid, making it well suited 
to capture the range of behaviors observed in 
biology. Further, DREMI is a robust measure 
that remains consistent under multiple sub- 
samples of the data (fig. S7). 

Characterization of signaling relations 
by curve-fitting 

In the case of a strong relationship (high DREMI 
score), we can derive a “response function” from 
the conditional density using curve-fitting. Con- 
ditional density is particularly suited for the der- 
ivation of an edge response function because it 
allows for identification of the consensus F re- 
sponse (densest region in DREVI; the deep red in 
the heat maps) for each value ofZ. We compare 
curve-fitting of the conditional density with di- 
rectly fitting the raw data in order to demonstrate 
the superiority of the former— both in terms of fit 
[root mean squared error (RMSE)] and in terms 
of the interpretability of the parameters (fig. S8). 
For a well-fit curve, we expect most of the data 
points to fall in close proximity to this curve, as 
measured with RMSE. A fit to the conditional 
mean results in a sigmoidal curve that closely 
follows the data points (indicated by low RMSE), 
whereas the raw data best fits a line— but the fit 
is of lower quality (indicated by high RMSE). An 
optimal sigmoidal fit on the raw data results in 
degenerate curves, in which most data points 
reside at a large distance. 

Fitting a curve in this manner allows for a 
parametric description of the relationship be- 
tween the two proteins Z and F. Such a func- 
tional description of the relationship between Z 


and F enables us to potentially predict the value 
of F, if the value of Z is altered by an intervention 
or drug. To derive the edge-response function, 
we fit points sampled from the conditional den- 
sity to one of three models using regularized re- 
gression: linear, sigmoidal, or double sigmoidal 
(27). If none of the models result in a good fit, the 
model is then fit to a free-form curve. 

In Fig. 1, we fit DREVI plots of the pCD3^- 
pSLP76 edge before and after TCR activation to a 
sigmoid curve (Fig. ID, I and II). Among other 
characteristics, the parameters of the fitted sig- 


moid specify the lower and upper assunptote that 
correspond to a digital “low” and “high” mode of 
activation for the protein F, and the inflection 
point (the activation threshold of Z at which the 
protein F transitions from low to high state). 
Comparing the edge before TCR activation 
with after, there is a similar inflection point 
activation threshold in the pCD3^ axis but a 
considerably higher response on the SLP76 axis, 
and a corresponding increase in total activation 
of SLP76 (Fig. ID). The postactivation state re- 
sults in higher “area under the curve” (AUC), 


A Mutual information measures the decrease in uncertainty of Y given X 




Decreased 
Spread of Y 
in each X-slice 


B The high local joint density biases mutual information assessment 





X-range dominafes Ml 


C DREMi resamples Y from conditional density in each X-slice to reveal 
relationship between X and Y 



# cells: NNNNNNNNN 


Fig. 2. Outline of the DREMI method. (A) Simulated data depicting a relationship with high Ml. Across 
the entire range of X, Y has a wide range of values (left). After conditioning on X (red lines), the range of Y 
within each X-slice is substantially smaller (right). Thus, knowledgeofX's value provides information on Y's 
likely value. (B) The data represent the relationship between pCD3^ and pSLP76 30 s after stimulation of 
the TCR, as in Fig. IB, II. Most of the cells are concentrated around a single peak, as shown in the 2D density 
estimate (left). If we resample from this density estimate, samples will fall within a narrow range in which 
there is little change in Y's range. This narrow range dominates the Ml metric. (C) Instead, DREMI samples 
from the renormalized conditional density estimate, which covers entire X-range. So there is the same 
number of cells in each X-slice, as long as there are sufficient cells in the original data. Ml information is 
calculated over this resampled data. 
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which is proportional to the average amount of 
pSLP76 generated per quantity of pCD3^. A 
larger AUC implies more signal transfer activity 
so that after stimulation, less pCD3^ is required 
to induce a response in pSLP76. Thus, when the 
TCR is activated, not only do pCD3^ amounts 
increase, but this change is also more impactful 
than a similar increase would have been before 
stimulation. This increase in edge strength in- 
dicates that the pairwise relation between the 
two proteins is boosted by changes in the recruit- 
ment and localization of additional and poten- 
tially unknown proteins (the effects of which are 
suceessfully captured with DREVI and DREMI). 

Not only do the DREMI and AUC scores in- 
crease after T cell stimulation, but they also en- 
able distinguishing between different strengths 
and forms of stimulus (Fig. 3). Mouse T cells were 
stimulated with either CD3/CD28 or CD3/CD28/ 
CD4, and mass cytometry analysis was under- 
taken as per above. CD4-costimulus, in addition 
to CD3/CD28, is known to boost signaling re- 
sponses in €04"^ T cells by engaging additional 
pathways that reinforce the signal transmission 


{28). When comparing these two forms of stimu- 
li, we derive a higher DREMI score under the 
stronger stimulus in aU three of the edges shown 
in Fig. 3. The CD3/CD28/CD4 stimulus clearly 
leads to a higher activation (magnitude) and/or a 
lower activation threshold in the edge-response 
functions. This translates to a higher AUC in both 
the sigmoidal (pCD3^-pSLP76) (Fig. 3A, IV) and 
linear (pERK-pS6) (Fig. 3C, IV) cases. The pSLP76- 
pERK edge (Fig. 3B) can be interpreted as a 
stochastic “digital” response, with a larger per- 
centage of cells responding upon CD3/CD28/CD4 
stimulus (Fig. 3B, IV) (5). Together, DREMI, 
DREVI, and the edge-response function form a 
powerful toolbox for the analysis of signaling 
interactions. 

Canonically ordered signaling 
transfer in the TCR pathway 

We used DREMI to analyze the dynamics of the 
pCD3^-pSLP76-pERK-pS6 signaling cascade down- 
stream of the TCR (Fig. 4). The edge strengths and 
shapes were highly dynamic within just a few 
minutes after TCR activation. The DREMI scores 


pinpoint the “peak timing” of signal transduc- 
tion, meaning the timing when protein activation 
of Fis most sensitive to changes in the activity of 
protein X. The peak timings of all edges in this 
signaling cascade followed the expected canon- 
ical order (Fig. 4). For example, a pCD3^ and 
pSLP76 relationship, or edge, was detectable in 
a small population of cells when pCD3^ is at its 
highest value even before stimulation (Fig. 4, top 
left). However, after TCR activation, pCD3^ reached 
maximum signal transfer within 30 s and decreased 
soon after. The pSLP76-pERK edge scored high- 
est at 1 min after activation, and the pERK-pS6 
edge peaked at 2 min after TCR activation. Thus, 
the peak response follows the canonical ordering 
of the pathway in a manner analogous to the 
“just-in-time” transcriptional networks of meta- 
bolic pathways in Zaslaver et al. (29). 

The observed dynamics highlight the impor- 
tance of time series data and the impact of sig- 
naling dsmamics on the relationships that can be 
derived from each individual time point. For in- 
stance, the pSLP76-pERK edge (Fig. 4A) is only 
active in the 1- to 2-min time frame with a strong 


Fig. 3. Comparison 
between stimulation 
conditions. The cells 
are CD4'^ naive T 
lymphocytes of B6 
mice. (A) (I and II) 
The pCD3^-pSLP76 
edge at 30s after 
stimulation under 
both stimulation 
conditions. (Ill) As 
stimulation strength 
Increases, DREMI 
Increases. (IV) The 
Inflection threshold 
moves to the left, and 
the upper plateau of 
the response curve 
also Increases, 
resulting in a higher 
AUC. (B) The 
pSLP76-pERK edge 
2 min after stimula- 
tion. The blue lines 
indicate a partitioning 
between “low" and 
“high" states of 
pERK. The pie charts 
in IV show an increase 
in the percentage of 
cells that reach the 
high state at the 
higher dose. (C) The 
pERK-pS6 edge has a 
linear edge-response 
function. Here, 
increasing stimulation 
strength results in an 
increase in DREMI, 
shown in III, and AUC, 
shown in IV. 
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sigmoidal shape. The pSLP76-pERK edge is flat 
at other times. Such signaling dynamics are sug- 
gestive of the observed negative feedback loop, 
resulting in dephosphorylation of ERK perhaps 
by PTPN22 or SHP-1 {30, 31) because the influence 
of pSLP76 on pERK drastically decreased at 4 min, 
although pSLP76 remained activated at the 4-min 
time point (Fig. 4A, top row, fifth from left). The 
DREMI scores quantify the intuitive visual inter- 
pretation of the DREVI plots (Fig. 4B). 

This canonical pathway ordering of peak tim- 
ing is demonstrated again under the CD3/CD28 
stimulation (fig. S9). However, we see that under 
the CD3/CD28/CD4 stimulation, signals are trans- 
ferred downstream faster, and the signal transfer 
is sustained for longer periods. For example, the 
pCD3^-pSLP76 edge is only activated up until 
2 min under CD3/CD28 stimulation, whereas 
it remains active until 4 min under a CD3/CD28/ 
CD4 stimulatory environment. Additionally, the 
pSLP76-pERK edge activates only after 2 min 
under the CD3/CD28 stimulation but is activated 
within 1 min under CD3/CD28/CD4 stimulation. 
This is in accordance with upstream influences 
that change the nature and speed of the signal- 
ing behavior downstream as reflected in the edges 
more distal to the TCR This increased activation 
possibly is due to additional signaling pathways 
that converge through other signaling network 
components downstream of the TCR 

DREMI quantified the strength of each known 
edge in a given network, and the dynamics of 
how the molecular interactions underlying this 
edge changed over time. Together, they enabled 
an elucidation of a classically understood path- 
way in a single experiment while providing subtle 
insights into the timing of signaling relation- 
ships within the network. 

Performance evaluation of DREMI 

To compare the performance of DREMI with 
other approaches that quantify relations typi- 
cally used for this data type, we used the ca- 
nonical timing of signaling edges described in 
the previous section as a basis for comparison. 
We compared these metrics on six edges to eval- 
uate the ability of each method to rank edge 
strengths at various time points after TCR stimu- 
lation. DREMI outperformed Pearson correla- 
tion and recently developed dependency measures 
such as maximal information coefficient (MIC) 
(32) and adaptively partitioned MI (Fig. 51 and 
fig. SIO) (23). DREMI identified the peak timing 
of signal transfer in all six test cases. Adaptive 
partitioning missed the peak timing in four of 
the six cases, including two cases in which 
adaptive partitioning picked the prestimulation 
time as the peak, and typically does not perform 
well when the joint density is concentrated in a 
narrow range. In such cases, adaptive partition- 
ing places many partitions in the dense region 
(so that the partitions are often finer than the 
sensitivity of the experimental technology) and 
only a few in the sparse regions. MIC tends to 
pick peaks similarly to adaptive partitioning, 
identifying the correct peak in only one case. 
MIC has a further limitation in that there is very 


little range in the scores, possibly because it aims 
for equitability (33), which is a condition that 
does not hold in these data. Pearson correlation 
missed the peak timing in two cases and typically 
did not perform well for noisy and nonlinear 
relationships. We also compared the performance 
of DREMI in an additional biological system 
using published mass-cytometry data focused 
on human B cells after B cell receptor stimula- 
tion, as described in (8). Similar to T ceU signaling, 
DREMI was able to quantify and rank relation- 
ships better than metrics typically used for such 
data (fig. Sll). 

Last, we compared DREMI with other ap- 
proaches on synthetic data that was designed to 
exemplify features of realistic data (Fig. 5II). We 


compared two weak, low-noise linear relation- 
ships against two strong, nonlinear noisy rela- 
tionships. Specifically, we included a sigmoidal 
relationship and a nonmonotonic quadratic 
function typical of systems that simultaneously 
induce both protagonist and antagonistic path- 
ways (34). The weak relationships were charac- 
terized by only slight changes in the distribution 
of Y compared with its dynamic range, as X in- 
creases (small slope), whereas the stronger non- 
linear relationships have more pronounced 
change in the distribution of Y with changes in 
X. In both examples, the other methods scored 
the weaker linear relationship higher (Fig. 5II). 

A key advantage of single-cell technologies 
such as mass cytometry and flow csdometry are 
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Fig. 4. DREVI and DREMI reveal the dynamics of TCR signaling. The pCD3^ -pSLP76-pERK-pS6 
cascade at various time points after TCR activation. All panels represent data from naive T 

lymphocytes from B6 mice with CD3, CD28, and CD4 cross-linking. (A) Each row represents an edge in the 
cascade, and each column represents a time point after TCR activation. DREVI plots show the dynamics of 
TCR signaling; edges change their strength and shape as signaling proceeds. The scale used is the same as 
in Fig. 1, with dark red representing the behavior of the majority of cells, whereas the other colors depict the 
heterogeneity and spread of the response. (B) Network diagrams with DREMI values indicated on each 
edge, columns matching the same time points as in (A). For each edge, the color indicates the computed 
DREMI values, with darker purple indicating a higher DREMI score. DREMI captures peak signal transfer 
timings in each of the three edges. 
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I. Comparison on mass cytometry Teel I data 
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Fig. 5. Comparison of DREMI with three other metrics. The metrics 
compared are mutual information using adaptive partitioning (23), MIC (32), 
and Pearson correlation. (I) Comparisons are performed on edges in naive CD4'^ 
T cells under CD3 and CD28 stimulation. In each case, the green outline indicates 
the point with the strongest relationship upon visual inspection of the DREVI plot. 
(A) The pCD3^-pSLP76 edge. Only DREMI identifies the poststimulation 0.5 m 
time point as having a stronger signaling relationship than the prestimulation 
data. (B) The pMAPKAPKII-lkBa relationship as an example of a negative rela- 


tionship. DREMI successfully identifies this negative relationship, whereas other 
methods are unable to detect it. Four additional edges are compared in fig SIO. 
(II) Comparison of dependency metrics on synthetic data. Both parts (A) and 
(B) show comparisons between a noiseless but weak dependence of Y on X, 
compared with a noisy but strong dependence of Ton X, (A) sigmoidal and (B) 
quadratic relationships. The other three metrics score the linear, weaker depen- 
dence higher than the noisier but stronger dependence. Because single-cell 
signaling data are often noisy and nonlinear, DREMI is better suited. 
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the provision of millions of data points in each 
experiment. Therefore, the scalability of the 
analysis is imperative to effectively analyze such 
data. DREMl is faster and more scalable than 
either MIC or adaptive partitioning (fig. S12). On 
a data set with 500,000 cells, DREMl analysis 
can be computed within 10 s. Ml with adaptive 
partitioning takes over 90 min to perform the 
same computation, and MIC is limited to data 


sets of less than 10,000 cells. Thus, DREMl is a 
versatile metric that can handle a variety of char- 
acteristics, including large sample size, non- 
linearity, and different degrees of stochasticity. 

Unlike the other metrics, DREMl is a direc- 
tional metric; DREM1(X, Y) is not necessarily 
equal to DREM1(F, X) (fig. S13). We based our 
analysis on the knovm network diagram and 
scored DREMl in the causal direction of influ- 


ence. Evaluating DREMl in the reverse direction 
shows a slight degradation of performance com- 
pared with DREMl in the correct direction (figs. 
SIO, Sll, and S14). However, reverse DREMl out- 
performs the three symmetric measures used in 
our comparison and has the advantage that it 
can handle nonlinearity and noise (fig. S14). The 
correct causal direction scores approximately equal 
or higher in three of the four edges explored at 
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Fig. 6. Comparison of naive with antigen-exposed T cells. Data from CD4^ 
naive (CD44'°'") and effector or memory (antigen-exposed, CD44'^'®*^) T 
lymphocytes from B6 mice with CD3, CD28, and CD4 cross-linking. (A and B) 
We compare the pCD3^-pSLP76-pERK-pS6 cascade between the naive and 
effector or memory CD4'^ T cells using (A) DREVI and (B) DREMl. The time 
points in (A) match the indicated time points in (B). Both depictions show that 
signal transmission is sharper and more sustained in naive cells. (C) Bar graph 
depicts the higher peak-DREMI detected in naive cells; DREVI plots with edge 


response functions show alterations in edge behavior during peak signaling. The 
pCD3^-pSLP76 edge response function shows an earlier inflection point in 
effector or memory cells and is relatively flat after induction. This suggests only 
a small amount of pCD3^ is needed to initiate a complete response. In contrast, 
pSLP76 needs more pCD3^ for a similar response in naive cells. The pSLP76- 
pERK edge indicates that the pERK response is weaker and requires more 
pSLP76 in these cells. Last, the pERK-pS6 edge shows a steeper slope in naive 
cells, indicating that pS6 responds more strongly to pERK levels in naive cells. 
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peak signaling times (fig. S13). The exception was 
pERK-pS6, where pS6 is known to have many 
additional factors influencing its activity, a recur- 
ring situation in cases in which DREMI in the 
reverse-causal direction scores higher. We show 
that increasing the number of parents decreases 
the DKEMl in the causal direction more than in 
the reverse direction both in linear and non- 
linear functions (fig. S15). 

Comparison of signaling interactions 
between T cell subtypes 

The high dimensionality of mass csdometry en- 
abled us to simultaneously treat and measure 
multiple T cell subsets, including both naive and 
antigen-exposed 004"^ T cells. It is well docu- 
mented that naive T cells behave differently than 
cells that have previously seen and “remember” 
antigen. In naive CD4'^ T cells, the activation of 
the TCR and appropriate coreceptors under the 
right cytokine environment leads to differentia- 
tion toward different effector cell fates (11). In 
response to the same TCR activation, antigen- 
experienced effector or memory T cells pro- 
liferate and respond to the infection (12-14). 
Moreover, the activation response time is known 
to be faster and of greater magnitude in effec- 
tor T cells than in naive T cells. However, the 
signaling mechanisms that create these differ- 


ences are not yet clear. The data set collected 
here enabled direct comparison of signaling 
responses in T cells before and after exposure to 
antigen because the cell subsets are in the same 
experimental tube and are resolved after measure- 
ment by means of gating techniques (such as the 
differential expression of the surface marker 
CD44). 

We used DREMI and DREVI to analyze dif- 
ferences along the signaling cascade that starts at 
pCD3^ and continues down through pSLP76, 
pERK, and pS6. Naive and antigen-exposed T 
cells had three main differences in their responses 
to TCR cross-linking. First, we found that along 
each edge in this pathway, naive cells had higher 
peak-DREMI, indicating that signals were trans- 
ferred more faithfully from the upstream mole- 
cule to the downstream molecule (Fig. 6, A to C). 
Second, we found that naive cells had more sus- 
tained periods of high-DREMI (Fig. 6B), which 
indicated that signal transduction events tran- 
spired longer. Third, despite the fact that there 
was a more faithful signal transfer in naive T cells, 
the levels of most of the measured phospho- 
proteins along this cascade, except pCD3^, were 
observed to be higher (constitutively starting at 
time 0) in antigen-exposed cells (Fig. 6D). 

The peak DREMI score (Fig. 6C) in either the 
naive or the effector cascades occurs at 30 s for 


the pCD3^ -pSLP76 edge, at 1 min for the pSLP76- 
pERK edge, and at 2 min for the pERK-pS6 edge. 
All three of these were substantially higher in 
naive cells than in effector-memory cells. Further, 
the pCD3^ -pSLP76 relationship sustained signal 
transmission (high DREMI) for 2 min in naive 
cells, compared with only 30 s in antigen-exposed 
cells. Similarly, the pSLP76-pERK edge had high 
DREMI between 1 and 3 min in naive cells, 
whereas DREMI was only high between 1 and 
2 min for antigen-exposed cells. Thus, at each 
stage in this cascade, Y transmitted more infor- 
mation about X, over a longer time period, in 
naive cells than in antigen-exposed cells, suggest- 
ing tighter control in naive cells. This difference 
between naive and antigen-exposed cells was 
consistently observed across multiple replicates 
(fig. S16). 

Although edge strength was higher in naive 
cells, pSLP76 levels were higher in antigen- 
exposed cells independent of pCD3^ levels (Fig. 7A). 
The near-saturation of the pSLP76 levels can 
also be seen in the high and fiat slope of pCD3^- 
pSLP76 edge at 30 s (Fig. 6C). Similarly, pERK 
amounts were more faithfully transmitted to pS6 
in naive cells, whereas pS6 amounts were basally 
higher— independent of pERK— in antigen-exposed 
cells (Fig. 7A). In the case of TCR activation with 
CD4 costimulation, the analysis suggests that the 
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Fig. 7. Comparison of naive with antigen-exposed T ceiis on 
the pAKT pathway. Data from 004-" naive (CD44'°'") and 
effector or memory (antigen-exposed, 0044*^'®*^) T lymphocytes 
from B6 mice with CD3, CD28, and CD4 cross-linking. (A) Com- 
parison of phophoprotein abundances between of naive CD4'^ T 
cells, with effector or memory 004"^ T cells. Red indicates the 
phosphoprotein levels amounts are higher in naive CD4'^ T cells, 
and blue indicates the phosphoprotein amounts are higher in 
effector or memory CD4'^ T cells, demonstrating that differences 
in the average behavior of nodes (protein species) are not nec- 
essarily telling of the differences in edges (relationships between proteins). 
(B) DREVI plots comparing naive and effector or memory cells on the 
pAKT-pS6 edge from 0 to 3 min after stimulation. (C) Network diagrams 
depicting DREMI scores in naive and effector or memory cells, showing the 
combined pCD3^-pSLP76-pERK-pS6 and pCD3^-pAKT-pS6 pathways.The prestimulation pAKT-pS6 edge is stronger in 
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AKT pathway might explain the higher level of 
pS6 in antigen-exposed cells. The AKT-pS6 edge 
had higher DREMI in antigen-exposed cells than 
in naive cells (Fig. 7, B and C, and fig. S17) in early 
time points. Moreover, the AKT-pSe relation was 
active before stimulation in antigen-exposed cells, 
providing a constitutive boost in signaling, per- 
haps through positive feedback between pS6 and 
pAKT (35). 

Together, these differences suggest that antigen- 
exposed cells are poised toward a more easily 


triggered response (. 12 - 14 , 36 ). Because the basal 
levels of the phospho-proteins are higher, a 
shorter period of signal integration (as indicated 
by less sustained periods of high-DREMI) would 
be needed to elicit a cellular response. On the 
other hand, naive cells, which are experiencing 
antigen for the first time, may need to mount a 
more cautious response, with the signaling cas- 
cade transmitting information carefully for longer 
periods of time to avoid spurious activation 
( 12 - 14 , 36 ). 


ERK-knockout validation 

Because DREMI analysis determined that the 
influence of pERK on pS6 is stronger in naive 
cells, this predicts that knockout of ERK2 should 
have a larger impact on pS6 levels in naive cells 
(Fig. 8A), and this difference should be detected 
at the time the DREMI of this edge increases, 
2 min after stimulation. We used the edge- 
response function and the measured difference 
in pERK abundance to approximate the expected 
change in pS6 levels at 2 min after knockout with 


Fig. 8. Validation of dif- 
ferences in edge-strength 
using an inducible Erk2- 
knockout mouse. Data are 
from CD4'^ naive T lym- 
phocytes from both B6 and 
Erk2-knockout mice with 
CD3 and CD28 cross- 
linking. (A) DREMI is used 
to compare the strength of 
the pERK-pS6 relationship 
between naive and effector 
or memory T cells in B6 
mice. Higher DREMI in 
naive T cells predicts a 
bigger impact of Erk2 
knockout on pS6 in naive 
cells. (B) Heatmaps show 
the amounts of pERK and 
pS6 in the control and 
knockout mice cells. (C) 
Edge-response functions 
for B6 mice at the 2-min 
time point predict the out- 
come of a knockout exper- 
iment in (I) naive cells and 
(II) effector or memory 
cells. The x axes of both 
figures are annotated with 
the measured change in 
mean pERK levels upon 
knockout, and the y axis 
shows the predicted 
decrease in pS6 levels 
based on the edge- 
response function. (D) I and 
II show two replicates 
demonstrating the change 
in pS6 amounts after 
ERK2 knockout. The blue 
bars represent the aver- 
age difference between 
pS6 in normal and ERK2- 
knockout mice cells in 
naive T cells. The maroon 
bars show the same 
difference in effector or 
memory cells. The blue 
bars are significantly 
higher than the maroon 
bars, especially after the 
2-min time point (as pre- 
dicted): therefore, our 
edge strength assessment 
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an expectation to observe a change that is ~50 
to 60% larger in naive cells relative to effector/ 
memory cells at 2 min (Fig. 8, B and C). To val- 
idate these predictions, we collected data from 
a mouse model (57), which enables tamoxifen- 
induced knockout of ERK2 within a normally 
differentiated T cell population. To allow for di- 
rect comparison, immune cells from both the B6 
and ERK knockout mice were stimulated and mea- 
sured in the same tube and subsequently identified 
through congenic markers CD45.1 and CD45.2. 

To test the hypothesis that naive T cells showed 
a stronger relationship between pERK and pS6, 
based on the relatively higher DREMI in naive 
cells, we assessed the impact of the Erk2 depletion 
on pS6 levels in both naive and antigen-exposed 
cells. As predicted, changes in pS6 abundance are 
larger in naive cells than in antigen-exposed cells. 
Approximately 50% greater response In naive cells 
began at 2 min, as predicted (Fig. 8D). DREMI 
also predicts that the influence of pERK on pS6 is 
stronger in 004"^ T cells than CDS"^ T cells (fig. 
S18), which was also confirmed experimentally 
(fig. S19), demonstrating that DREMI can cor- 
rectly predict differences in signaling edge strengths 
between related T cell subsets. 

Discussion 

DREVI and DREMI combined with single-cell 
data provide solutions to a challenging problem 
that has obstructed the analysis of single-cell 
biology: that of quantifying the strength of the 
underlying complex relationships between pro- 
teins from data. Other approaches sueh as cor- 
relation, mutual information, and MIC— whieh 
are typically used to quantify the strengths of 
interactions— are not always reliable in the face 
of teehnical noise, natural stochasticify, bimodalify, 
spurious correlations, and nontinearify inherent in 
single-cell data. DREMI, on the other hand, con- 
sistently quantifies the relative strengths of rela- 
tionships and how they change after stimulation 
through time and between different cell t 5 q)es. 

We used DREVI and DREMI to functionally 
and quantitatively analyze edges within intra- 
cellular protein networks. DREVI and DREMI 
directly regard the single-cell population as in- 
stantiations of an underlsdng signal processing 
eircuit and infer the form and strength of each 
edge in the circuit. Many DREVI plots have a 
sigmoid (or partial sigmoid) response curve, sug- 
gestive of the signaling kinetics. With absolute 
quantification of cytometry data, it would be pos- 
sible to derive biochemical rate constants for 
multiple enzymatic pairs and multiple subsets in 
a single experiment. This would provide an enor- 
mous advantage over laborious experiments eur- 
rently needed to compute such rate constants. 
Therefore, a versatile, data-driven approach such 
as that represented here enables the systematic 
and quantitative comparison of signaling rela- 
tionships between different stimuli, time points, 
and cell types. 

The analysis in this manuscript focused on 
mass cytometry data collected from mouse T cells 
measuring 11 phosphoproteins and mne surface 
markers. The analysis determined that the TCR 


signaling network is dynamic, with edges acti- 
vating and deactivating quickly as the signals are 
transduced downstream. We used DREMI and 
DREVI to gain insights into how signaling is fine- 
tuned between related T eell populations. The 
differences we identified in TCR signaling be- 
tween naive and antigen-exposed T cells suggest 
that naive cells more sensitively transmit up- 
stream signaling inputs along a central signaling 
cascade. This more fine-tuned signaling response 
might help regulate the cell’s decision to differ- 
entiate into either regulatory or helper T cells. In 
contrast, entrained effector or memory cells seem 
poised for fast responses upon secondary expo- 
sures. To explain this observation, we noted that 
several alternative pathways had higher activity 
in antigen-exposed eells, including AKT under 
CD4 costimulation, which likely bolsters phospho- 
protein levels and shortens input integration time 
in antigen-exposed cells. 

Cumulatively, these techniques can allow the 
traeking of subtle reconfigurations in signaling 
that occurs between related cell subtypes. These 
reconfigurations do not result in an essentially 
rewired network, but rather a network that is 
behaviorally reconfigured to produce changes in 
response. Although we have presented a specific 
application of our techniques, focused on TCR 
activation in T cell subsets, we believe that DREVI 
and DREMI are broadly applicable across biol- 
ogical systems and single-cell technologies (38, 39). 
Our ability to characterize signaling edges and 
their strengths is of partieular value for disease 
studies because it has been shown that many 
disease alleles affect the edge strengths of the 
signaling network (40). In (Iff), we report the direct 
application of these methods to identify and char- 
acterize the impact of complex genetic variation 
associated with type-1 diabetes on TCR signaling 
networks. As single-cell data become more abun- 
dant, our methods will allow construction of 
more quantitative models of cellular signaling. 
Such models can then elucidate how these sig- 
naling circuits differ between cell types as well as 
characterize essential and therapy-targetable differ- 
ences between health and disease. 

Materials and methods 
Mice 

Mice were maintained in specific pathogen-free 
facilities at Harvard Medical School (Institutional 
Animal Care and Use Committee 99-20, 02954). 
Age-matched, 4- to 6-week-old male B6^ mice 
were used for all experiments. For TCR signaling 
responses in peripheral T cells from TCR trans- 
genic mice, BDC2.5 and BDC2.5/B6®’^.Rag“^“ mice 
were used. This mouse strain was previously 
described (41). 

Cell stimulation 

For analysis of phosphorylation events by means 
of mass cytometry, total lymph node eell suspen- 
sions from Bd®’ were used. Cells were stimulated 
in medium containing 2 or 6 pg/ml of biotinyl- 
ated stimulatory antibodies to CD3e and CD28 
and incubated for 2 min at 37°C before addition 
of 8 or 24 |ig/ml streptavidin. At various times 


after cross-linking, the stimulation was stopped 
by addition of paraformaldehyde (final concen- 
tration 2%) and incubation at room temperature 
for 20 min. In some experiments, biotinylated 
antibodies to CD4 were also included in the 
stimulation mix. The eells were measured before 
stimulation and at various time points after stim- 
ulation: 30 s, 1, 2, 3, 4, 6, 8, 10, 20, and 80 min. 

Erk knockout experiment 

To study the role of ERK in peripheral T cells, we 
used an Inducible model that was developed in 
(37) tvith the Erk2 floxed-gene and ER-CRE, 
which is a tamoxifen-inducible adenosine 3',5'- 
monophosphate (cAMP) response element (CRE)- 
recombinase. Adult mice are injected with tamoxifen, 
every day (Sigma-Aldrich, St Louis, MO) for 6 con- 
secutive days, to induce the deletion of the floxed 
gene and create ERk2-deficient cells. The mice 
were analyzed 2 days later. There are normal num- 
bers of peripheral T cells in these mice because 
they are analyzed only 2 days after the tamoxifen- 
induced deletion. Therefore, these peripheral 
cells are a result of normal differentiation and 
transportation from the thsmius weeks before the 
tamoxifen treatment injections (37) but have ERK2 
knocked-down for the TCR activation experiment. 

Normal B6 and Erk knockout cells express 
different congenic CD45 surface antigens (Erk- 
knockout mice express CD45.1, and normal B6 
express CD45.2), allowing for the stimulation of 
both cell types in the same tube. B6 and Erk 
knockout cells are therefore mixed before stim- 
ulation and identified after analysis with anti- 
bodies against CD45.1 and CD45.2. This strategy 
has the considerable advantage of eliminating 
any contribution of experimental variations dur- 
ing the stimulation, staining, and CyTOF process- 
ing of the samples (for each time point analyzed). 

Measurement of phospho-signaling 
events with mass cytometry 

In our experiments, we used mass csdometry to 
measure surface and internal proteins. Primary 
conjugates of the mass cytometry antibodies (com- 
plete list is provided in table SI) were prepared 
and titrated as described (8). Paraformaldehyde 
(PFA) fixed and frozen Isunphocyte suspensions 
were thawed at room temperature and then trans- 
ferred to ice. 

Single samples of 1 million cells were stained 
in a 50 |il final volume. All samples from a given 
time series of a stimulation-condition were run 
in a single barcoded tube. Eaeh barcoded tube 
eontained 20 individual samples and was stained 
in 300 pi. Cells were stained with a cocktail of 
metal isotope conjugated antibodies (table SI) by 
using a two-step procedure (surface stains in- 
cluded CD4, CDS, TCRb, CD5, CD19, CD25, CD44, 
CD45.1, and CD45.2; intracellular stains included 
pCD3^, pSLP76, pERKl/2, pS6, pCREB, pMAP- 
KAPKII, IkBa, pNFkB, pRB, pFAK, and pAKT). 
The signaling network consisting of all markers 
is shown in fig. SI. Cell fixation, permeation, and 
staining were performed as previously described 
(8). To make all samples maximally comparable, 
all data was acquired by using internal metal 
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isotope bead standards as previously described 
in {42). Approximately 100,000 events were acquired 
per sample on a CyrOF I (DVS Sciences, Surmyvale, 
CA) by using instrument settings previously de- 
scribed in {42). 

Barcoding of samples 

To reduce sample-to-sample variability, cells cor- 
responding to each stimulation condition and 
time point were barcoded by using all six stable 
palladium isotopes with a method similar to that 
described by {43). In the barcoding scheme, each 
isotope is used in binary fashion, high for “1” and 
low for “0.” Hence, the barcode itself is a binary 
string that can be decoded for each cell to de- 
termine the treatment condition for that ceU. For 
this purpose, we developed an error-correcting 
3-one barcoding scheme that both facilitates 
ease of debarcoding, and eliminates the major- 
ity of doublets. All legal barcodes in our scheme 
have exactly 3 ones and 3 zeroes, resulting in 20 
possible barcodes. This barcoding scheme also 
enables the detection of most cell doublets be- 
cause in a well-mixed tube, it is unlikely to have a 
doublet created by two cells from the same con- 
dition. Doublets with cells from two different con- 
ditions would have an illegal combined barcode 
(with more than 3 “T’s) that is represented by the 
logical OR of the individual barcodes. Therefore, 
this scheme eliminates almost all doublet events. 
Further, each pair of barcodes has a distance of at 
least 2, and therefore, this scheme is also robust 
to errors during the debarcoding. 

Post-processing of mass cytometry data 

Individual time series were normalized to the 
internal bead standards before analysis by using 
the method described in {42). In addition, as de- 
scribed in (8), for all analysis of the mass cytom- 
etry data, abundance values reported by the mass 
cytometer were transformed by using a scaled 
arcsinh, with scaling factor 5 to minimize the noise 
in measurements where the data are close to zero 
(the limit of detection on the mass cytometry scale). 
Events recorded by the mass cytometer were 
gated based on cell length and DNA content (to 
avoid debris and doublets), as described in (8). 

Gating of cell subsets 

Cells were then filtered through a series of gates, 
as depicted in fig. S2, for the purpose of isolating 
particular subpopulations of T cells such as 004"^ 
naive T cells, CDS"^ naive T cells, and 004"^ 
effector/memory T cells. These gates include 
(gate 1) a gate that separates T cells (TCRb"^ 
CD19“) from B cells (TCRb“ CDIP"^), (gate 2) a gate 
that further separates CD4'^ T cells (€04"^) from 
CDS"^ T cells (CDS"^), (gate 3) a gate that further 
separates CD4'^ T cells into conventional T cells 
(Tconv cells) (CD25“) and regulatory T ceU (T,^g 
cell) (0025"^) populations, and (gate 4) a gate that 
further separates CD4'^ Teonv cells and CDS"^ T cells 
into naive (CD44“) and antigen-exposed effector/ 
memory populations (CD44'^). Because there is 
no known CDS"^ equivalent of the CD4'^CD25'^ 
Treg population, CDS"^ cells are not passed through 
gate 3. The final populations that we isolate and 


use in our comparisons include (i) naive 004"^ 
T cells (CD44- CD25“ CDT-" CDS' TCRb"" CD19'), 
(ii) naive CDS'" T cells (CD44' CD25' CDS'" CD4' 
TCRb'"CD19'), and (iii) Effector/memory CD4'" 
T cells (CD44'" CD25' CD4'" CDS' TCRb"" CD19'). 
The nodes, ceU types, stimulation, and time points 
used in each figure are shown in table S2. 

Data availability 

All data are available for download at www.c2b2. 
columbia.edu/danapeerlab/html/dremi.html. 

Description of computational methods 

In the following sections, we provide a detailed 
explanation of how DREVI, DREMI, and the edge- 
response functions are computed. 

Kernel density and conditional 
density estimation 

A kernel density estimate / {x) is a nonparam- 
etric approximation of the probability density 
function iK^). The standard method for compu- 
tation of a kernel density estimate is to introduce 
a narrow Gaussian (or alternative) kernel at each 
point in the raw data and calculate the integral of 
the individual kernel values over a large set of 
points {21, 44). If (a?!, X 2 , x^, ... a;„) are inde- 
pendent, identically distributed random samples 
from an unknown probability density function f, 
then the kernel density estimator is 

/(■»)= ^ ZiLi Ku{x- Xi) ( 1 ) 

where K is the kernel function. The commonly 
used Gaussian kernel is given by 

K(x)=^^e~^^) ( 2 ) 

Kjt refers to the kernel density function scaled by 
yh (where h denotes the bandwidth). Intuitively, 
one can think of kernel density estimation as 
placing a smaU smooth kernel Kf^ at each sam- 
pled point and integrating over aU such kernels 
to determine a smooth probabUity density func- 
tion. The kernel density estimate in two dimen- 
sions is given by 


(ii) The transition probability of going from 
point Xi to Xj is giveji the Gaussian kernel 

P\:cans{T:i^rCj) = 2( 

The Gaussian kernel density estimation (KDE) 
can be interpreted as the probability distribu- 
tion function (PDF) for this process at time t, 
which is the same as Eq. 2 with bandwidth 


But because this is an iterative process, the 
transition satisfies the following differential 
equation, which is also known as the “heat 
equation”: ^ (x; t) = \^f (x; t),t > 0^ 

The initial condition is given by/(a;;0) = 
where 5 is the dirac delta function 

i=l 

(which gives point-masses to all data points). The 
advantage of writing this in PDF form is that if 
the PDF is known to be 0 outside the range, then 
boundary conditions enforcing this can be added, 
which results in a 9 kernel rather than a Gaussian 
kernel: 

f{x; t) = ^^%{x,xr,t) 

•1 n 00 

= X e”^”*^^cos(kra)cos(kM;i) 

i=i 


(4) 

The 0 kernel is locally adaptive and has the prop- 
erty that it behaves like a Gaussian kernel for 
small bandwidths and like a uniform kernel for 
large bandwidths. The evolution of the heat 
equation decides the shape of the kernel rather 
than having a fixed Gaussian estimate. This often 
results in a smoother density function (fig S3). 
Additionally, (i) it can be solved quickly by using 
fast Fourier transform methods, (ii) it can be ex- 
tended to a more general diffusion equation easily, 
and last (iii) the bandwidth t can be solved as a 
fixed point solution to a recursion without a 
normal assumption on the distribution. The su- 
perior performance and speed of computation of 
the heat-equation-based KDE is shown in fig. S3. 

To obtain the standard conditional density 
estimate, the joint density estimate / (x, y) is 
divided by the marginal density estimate / {x) 
as follows (45): 


fhm2{x,y) = Y!i=i^hi{x - x:i)Kh2(y-y\) (3) 

This standard method of kernel density estima- 
tion suffers from several deficiencies, including 
(i) sensitivity to selected bandwidth and the as- 
sumption of normalcy in many bandwidth estima- 
tion methods, (ii) boundary bias, and (iii) over- or 
under-smoothing. 

Instead, we use the heat-diffusion method 
from {10) for two-dimensional (2D) kernel den- 
sity estimation. This method views the kernel den- 
sity estimate as a solution to the heat equation, 
which it evolves for a time period proportional to 
the bandwidth. This interpretation of a kernel 
density estimate derives from the notion of a 
“Weiner” process. A Weiner process Wis a sto- 
chastic Markov process (the next state can be 
directly computed by the previous state) with 

(i) Initial probability uniformly spread through 
the d-dimensional data points {x-^, x^, x^, ... x^). 


f {y\x) 


f(y-x) 

/(■») 


(5) 


We use the conditional density estimate as a 
starting point for DREVI and DREMI as ex- 
plained in the following sections. 


Conditional density-rescaled visualization 

DREVI is aimed at visualizing the stochastic 
function representing how X influences Fby de- 
picting the distribution of Y for each value of X 
To visualize 2D relationships in single-cell data, 
we begin by computing a 2D kernel density esti- 
mate as described in Eq. 4 {10) on a square mesh 
grid. The mesh grid of points is denoted G = 
{(^i, yj),t < i < n,l <j < m}, and the 2-D kernel 
density estimate is a matrix D =f {G) where D 
{i, j) corresponds to f{xi, yy.Voe mesh grid of 
points G is selected so that the X-Y ranges are 
restricted to regions that are well-populated by 
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cells. Most markers (sueh as phosphoproteins) 
are distributed among the measured cells as 
heavy-tailed multimodal distributions (fig. SVC). 
For any pair of markers X and Y, we remove 1% 
of both tails of these distributions as outliers, re- 
moving -200 cells from a population of -10,000 
eells (per condition and time point). The remain- 
ing range is generally well populated by cells and 
is amenable for a robust density estimate, whieh 
results in robust visualization and scoring, as 
shown in fig. S7. 

After the 2D kernel density estimate is com- 
puted, each column of D (corresponding to a fixed 
X-value, X = xj)\s, renormalized by the marginal 
density estimate of X = a;* as shown in Eq. 5,/ {x) 
for that column to obtain the eonditional density 
matrix 

h(xuyj) =f {yj\xi) (6) 

This is an estimate of P{Y \ X) on the same mesh 
grid. This matrix is rescaled for the purposes of 
visuaUzation as follows: 


/ *{2/jki) =f{yj\xi)/mio<.k\f (yk\0Ci)] ( 7 ) 

We denote the matrix of resealed kernel density 
estimates as 

^*{Xi,yj) =f*(Xi\yj) (8) 

The matrix of Eq. 8 is visualized as a heatmap 
using the “jet” color scale, where densest regions, 
eorresponding to the highest rescaled conditional 
density, are maroon/red and the sparsest regions 
are blue. This way, theX-F relationship is revealed 
even in regions populated by fewer cells because 
each X-slice is renormalized individually. The 
eomputation of DREVI is demonstrated Fig. 1. 

Conditional Density-Resampled Mutual 
Information (DREMI) 

We developed a method for scoring relationships 
termed DREMI. Like MI, this will depend on 
how well the value of X can predict the value of Y. 
However, unlike MI, we are not interested in 
how the entropy of F deereases with X in a given 
sample on average. Instead, we are interested 
in looking at the underlying stoehastic function 
F(X) = Y, with the aim of learning the strength 
of this relationship over the entire dsmamie range 
ofX, regardless of the distribution in the specifie 
sample. 

The key mathematical insight in DREMI is 
that we provide a score for the conditional dis- 
tribution of P(F I X) rather than joint distribution 
P(X, F). DREMI resamples from the conditional 
density estimate as deseribed above, taking care 
to generate a uniform number of samples from 
each column of the matrix A*, so that the rela- 
tionship betweenX-and-Fis scored through the full 
dynamic range of X that is well sampled by the data. 

DREMI begins with the resealed conditional 
density matrix A* described in Eq. 8. To filter out 
noise, we round values of A*(a;j, yj) < e to zero 
and eliminate the corresponding points from the 
grid G. We generate n new points S,- = {si, si, si, 
..., si } from the density distribution in each 
column i of A*, denoted a* • We generate a total of 


nxm samples S = {Sj, S2,...SU. Because points 
in each column of a* have a similar value X = Xi, 
we are generating an equal number of samples 
from the entire range of X (up to the fine-mesh 
grid level of granularity). 

Last, we define a discrete partition of the X 
and F axes and compute mutual information 
I(X:Y) as in Eq. 10 using the samples in S. We 
generally use eight equi-partitions of the X and F 
axes to compute the mutual information on these 
samples, but our software can use any number of 
equl-partitions based on a user-defined param- 
eter. Our partitions are formed by pooling all 
data from eonditions being compared for a par- 
ticular variable, taking the maximal range of the 
X and F and partitioning that space into eight 
equal units along each axis. We normalize DREMI 
by the log base 2 of the number of partitions to 
provide a score that lies between 0 and 1. 

To derive the mathematical expression for 
DREMI, we start with standard mutual Infor- 
mation, computed as the differential entropy 
between F and F | X: 

/(F;X) = H{Y)-H{Y\X) = 

-Zp{%)iogb(%')]+ 

j 

i j 

i j 

ZZ7’(^i.%')iogb(%-ki)] (9) 

i j 


Alternatively, MI can be regarded as a measure 
of the distance between the joint distribution 
P(X, F) and the marginal distributions P(X) and 
P(Y). An equivalent formula for computing mu- 
tual information is given by (26) 


I{os,y) = 

i j 


' P{x:i,yj) ' 
p{xi)p{yj). 


(10) 


The first three steps of the DREMI algorithm in- 
volve estimating the conditional density by nor- 
malizing the joint kernel density estimate and 
filtering out noise. These steps result In a smoothed 
and de-noised conditional density estimate that 
will fin gaps in the raw data. The final step in- 
volves estimating MI from this eomputed con- 
ditional density. In fig. S20, we show intuitively 
how sampling from the conditional density esti- 
mate is equivalent to taking weighted samples 
from the joint distribution. Each point {Xi^ yj) 
from the joint distribution has sample weight 
proportional to 

Given this, DREMI can be written in terms of 
the differential entropy formulation of mutual 
information (Eq. 8) as follows: 


/(F I X) = H^(Y) - If(Y I X) (11) 


where If(Y) and If(Y\X) are reweighted entro- 
pies so that each point (x^ t/,) in the joint distri- 
bution is weighted by^^ given as follows: 


HHY) = -^^?^^l^\og[piyj)] (12) 


PP(F|X) 


yy P{Xi,yj) 
ij P{Xi) 


\o%[p{yj\xi)\ 


( 13 ) 


Substituting Eq. 13 and Eq. 12 into Eq. 11 gives 
P{Y\X) = -ZZ^^^l0S[P(%')]- 

-P{xi,yj) 


= 11^ 


P{Xi) 


■log[p(%'ki)]-log[p(%')] 




'pjyMj) 

. PiVi) . 

p(xi,yi) 


p{^i)p{yj) 

p{xj,yi) ' 


p(xj)p(yi) 


( 14 ) 


The final form In Eq. 14 is equivalent to Eq. 10, 
with each point reweighted by Hence, / 
maintains many properties of mutual informa- 
tion, including the fact that /(F | X) = 0 itp{x, y) = 
P(x)p(y). 


Robustness anaiysis 

We demonstrate that DREVI and DREMI are 
robust methods. As shown in fig. S7, the down- 
sampling of the data by 20% results in virtually 
identleal DREVI plots and very similar DREMI 
scores (SD of 0.02 or less) for several edges. The 
reason underlsdng this robustness is our use of 
density estimation with a loeally adaptive smooth- 
ing that results from the evolution of a differential 
equation whose solution is equivalent to the ker- 
nel density estimate (10). The density estimation 
method interpolates and fills gaps within the 
data by integrating using a smooth kernel. 

Fitting edge-response functions 

For strong relationships, P(F | X) is highly peaked 
for any value of X. Moreover, the peak of P(F | X) 
for adjacentX-shces tend to follow along a smooth 
and continuous curve, the “edge-response fune- 
tion,” which we fit to the data. We derive the 
edge-response function by fitting points sam- 
pled from the noise-filtered conditional den- 
sity A* to one of three models using regularized 
regression: linear, sigmoidal, or double sigmoi- 
dal (27). If none of the models obtain signifi- 
eant fits (based on P values), we fit to a free-form 
eurve (examples are provided in Figs. 1C, 3, 
and 5C). 

We define the edge-response function as the 
function/X) that best fits the data. We derive the 
edge-response function by performing nonlinear, 
mixed-model regression (46) of the high-valued 
regions of the conditional-density estimate. The 
models we use are 

(i) linear: fi(x) = ax + b 

(ii) sigmoidal :Xa;) = [(Z - U)/l + (aP/Kl)] + U 
(ill) double-sigmoidal:7(a;) = {[(Zj - Ul/\ + (xP! 

-Kdi)] + Ui} + {[(7.2 - Ul)!! (xPlIQn)-] U^} 

(iv) free-form eurve 

We minimize over both the models and pa- 
rameters in (i) to (iii). If no model results In 
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sufficiently high significance of fit, we fit to a 
nonparametric curve. 

Method availability 

The methods described, along with an interactive 
graphic user inteface (GUI) are availahle for down- 
load at www.c2h2.columbia.edu/danapeerlab/ 
html/ dremi.html. 

Performance of DREMI on synthetic data 

We used synthetic data to evaluate how DREMI 
behaves under increasingly noisy conditions (fig. 
S6). We generated the synthetic databases on 
characteristics of real pairwise relations observed 
in single-cell cytometry data: 

(i) X is concentrated in a narrow band within 
the entire X-range. 

(ii) The high-density region of the conditional 
distribution reveals a consensus of response that 
is smooth and can be fit to a curve. 

(iii) The noise in Y is variable. 

To fulfill these conditions, we generated data 
as follows: 

For condition (i), the X is a random variable 
that follows the distribution normal distribution 
with SD 0.25: N(2, 0.25). 

For condition (ii), the mean Y is functionally 
related to X: E(Y \ X = x) = fijx). We use linear, 
sigmoidal, and quadratic functional relationships 
in the synthetic data. 

For condition (iii), Fhas varying degrees of 
Gaussian noise added on top of its mean value: 
Y = fiX) + N(0, s), for parameter s. 

Each synthetic data set contained -20,000 
points. We generated ssmthetic data following 
the above conditions for five different values of 
noise: s = {0.25, 0.5, 0.75, 1, 1.25}. As shown in 
fig. S6, the results of computing DREMI on this 
synthetic data show that DREMI scores decrease 
roughly linearly with the SD of the Gaussian 
noise, regardless of the underlying functional 
shape of the relationship. 

In addition to the analysis described in the 
main text, we also evaluate how varying the 
noise-filtering threshold influences the results. 
Also shovm in fig. S6 is how the noise-filtering 
threshold e can be used to eliminate experimen- 
tal noise. Depicted in fig. S6B is a horizontal line 
that equalizes the scores of all of the noise levels. 
Thus, the value of e can be tuned to compensate 
for experimental or measurement noise. 

There are certain settings that provide natu- 
rally noisier signals than others. For instance, 
certain antibodies can have weak binding affin- 
ities with their target proteins, others can non- 
specificaUy bind to a variety of proteins. In such 
cases, a higher level of the outlier-elimination 
threshold may be useful. In other situations, such 
as the pSLP76-pERK edge, the response of the 
protein itself is stochastic, and therefore, the sto- 
chasticity itself has meaning. In such cases, the 
outlier-elimination threshold can be set higher 
to capture the natural stochasticity in the data. 
Other global contexts, such as measurements on 
primary cells as opposed to cell lines, lead to 
higher noise and weaker response. In these cases, 
it may be useful to set the noise-filtering threshold 


to a particular global value. We use a globally fixed 
noise-filtering threshold of 0.85 for aU figures and 
DREMI values computed for this manuscript. 
As shown in fig. S6, within any set noise-filtering 
threshold, the DREMI score decreases linearly 
with the SD of noise, and as such is able to 
relatively rank DREMI scores for comparison 
purposes. 

Comparing DREMI to other methods 

We compared the performance of DREMI 
against other commonly used dependency mea- 
sures: (i) MIC (32), (ii) adaptively partitioned MI 
(47), and (iii) Pearson correlation (Fig. 5). Be- 
cause no gold standard exists to compare strength 
of signal transfer along an edge, we created a 
framework to conduct the comparisons based 
on the dynamics of signaling. We used DREVI 
to visually determine the peak timing of the 
dynamic signaling response, and in cases in 
which peak timing is known, our visual inspec- 
tion matched published literature (2). We then 
assess whether each of the four metrics can 
recover the peak timing and also order the time 
points correctly according to strength of signal 
transfer. We systematically computed all of the 
metrics along the time series in each of the six 
edges (Fig. 51 and fig SIO). DREMI outper- 
formed all other metrics. 

To show that DREMI is able to quantify and 
rank relationships in settings other than TCR- 
stimulated mice cells, we also compared DREMI 
on three known signaling edges on previously 
published B cell data (8) from the bone marrow 
of healthy human donors after BCR (B cell re- 
ceptor) stimulation (fig. Sll). These data only 
have a single time point after BCR stimulation, 
therefore instead of ranking edge strength along 
time points, we rank edge strength across in- 
creasingly mature B cell populations. The five 
populations, in order of maturity, are (i) Pre-BI, 
(ii) Pre-BII, (iii) Immature B cells, (iv) Mature 
CD38-r B cells, and (v) Mature CD38“ B cells. The 
cells are gated into these populations based on 
phenotypic B cell markers such as CD20, CD38, 
CD45RA, CD19, and CD123, as described in (S). 
We used prior biological knowledge to deter- 
mine the cell population expected to have 
strongest signaling, determined before comput- 
ing the metrics. DREMI outperformed all other 
metrics on this data. 

An important goal of MIC is “equitability.” 
Intuitively, this means that different types of 
edge shapes are given similar scores if they are 
similarly “fuzzy” (33). Evaluating MICs’ perform- 
ance, this does not seem to be a suitable assump- 
tion for molecular relations. Adaptive partitioning 
puts more weight into the densest part of the 
distribution. Although this might make sense, in 
essence, it makes a large number of microscopic 
partitions over the small range where most of the 
density lays. Thus, the partitions often represent 
random measurement noise rather than biolog- 
ical signal. Pearson correlation prefers linear de- 
pendencies to other types of dependencies, whereas 
nonlinear dependencies are frequently present 
in signaling relationships. 


In Fig. 5, II, we compare DREMI on synthetic 
data, where the generating function is known. 
Here, we generate a weak linear relationship 
with low slope compared with the range of the 
data, Y = aX + N(0, 0.1), to compare against 
strong nonlinear relationships: 

(i) A quadratic relationship with a high degree 
of noise, Y = -AX^ + N(0, 0.75). 

(ii) A sigmoidal relationship X-Y with both 
Gaussian noise and uniform noise in the range 
[0, Y], for any value of F: F = sigmoid(X) + N(0, 

0. 75. -r 17{0,sigmoid(X)], where 17 is a uniform 
random function. 

In both cases, the noisy-yet-strong relation- 
ships scored lower than the weak Unear relation- 
ship, with low noise for all metrics other than 
DREMI. 

Evaluating edge-response functions on 
raw verses resampled conditional data 

In fig. S8, we show that the data resampled from 
the conditional density is more suitable for curve- 
fitting than the raw data. In the first two columns, 
we use linear, sigmoidal, or double sigmoidal 
least-squares regression on the raw mass cytom- 
etry data. In the third column, we resample from 
the dense regions of the conditional density and 
perform the same types of regression on this 
resampled data. Last, in the fourth column we 
perform regression on the conditional mean 
(computed from the conditional density). 

In each case, the best fit on the raw data has 
higher RMSE than the best-fit data that is re- 
sampled from the conditional density (measured 
on the raw data). In addition, the linear fit al- 
ways results in a better fit on the raw data, 
whereas the conditional data result in a variety 
of shapes linear, sigmoidal, and double sigmoi- 
dal. The raw data tend to be concentrated in a 
small region of the X-F plane, resulting in a weak 
penalty for deviations in most regions of the X-F 
plane, and a heavy penalty for deviation only in 
the region of data concentration. Therefore, a 
line that passes through the concentrated region 
of the raw data generally results in the best fit. 
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BLACK HOLE PHYSICS 

Black hole lightning due to particle 
acceleration at suhhorizon scales 
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Supermassive black holes with masses of millions to billions of solar masses are 
commonly found in the centers of galaxies. Astronomers seek to image jet formation using 
radio interferometry but still suffer from insufficient angular resolution. An alternative 
method to resolve small structures is to measure the time variability of their emission. 
Here we report on gamma-ray observations of the radio galaxy 1C 310 obtained with the 
MAGIC (Major Atmospheric Gamma-ray Imaging Cherenkov) telescopes, revealing 
variability with doubling time scales faster than 4.8 min. Causality constrains the size of 
the emission region to be smaller than 20% of the gravitational radius of its central 
black hole. We suggest that the emission is associated with pulsar-like particle acceleration 
by the electric field across a magnetospheric gap at the base of the radio jet. 


M ore than three decades ago, it was pro- 
posed that the radio emission of extra- 
galactic jets results from a relativistically 
moving plasma consisting of magnetic 
fields and accelerated particles following 
a power-law energy distribution (1). One of the 
major assets of the model is that it can explain 
the nonthermal emission of extragalactic jets 
across the entire electromagnetic spectrum, from 
radio waves up to gamma rays. The emission can 
be understood as synchrotron radiation and in- 
verse Compton scattering (2, 5) due to particles 
accelerated at shock waves in the jets. The gam- 
ma rays can reach very high energies measured 
in giga-electron volts (1 GeV= 10® eV, correspond- 


ing roughly to the rest mass energy equivalent of 
the proton) and tera-electron volts (1 TeV = 10*^ eV). 
According to the Blandford-Znajek mechanism, 
the jets are powered by extracting rotational en- 
ergy from the black holes, which have acquired 
angular momentum through the accretion of sur- 
rounding gas and black hole mergers (4), although 
so far astrophysical evidence for the role of black 
hole spin is still lacking (5). For a maximally 
rotating supermassive black hole of massM = 10® 
mg Mq, where JWq denotes one solar mass, the 
size of the jet formation region should be of the 
order of its gravitational radius, fg = / 

c^-l.S X lO^rngm (Gn, gravitational constant; c^, 
speed of light) and twice this value for a non- 
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rotating Schwarzschild black hole. Astronomical 
telescopes do not yet provide the angular reso- 
lution needed to image structures on this scale. 
The highest-resolution images of jets obtained 
with very long baseline radio interferometry show 
radio-emitting knots traveling down the jets (6). 
Approaching the black hole, the spectra cut off 
at increasingly higher frequencies due to syn- 
chrotron self-absorption. Observations at very high 
frequencies where the core becomes transparent 
are needed to zoom into the region where the jets 
are emerging from. The record holder is a very 
long baseline radio interferometry observation 
of the jet of the nearby radio galaxy M87 at a 
frequency of 230 GHz, resolving a source with a 
size of 11.0 ± 0.8 gravitational radii (7). 

The event horizon light-crossing time 

Although direct imaging of the jet formation re- 
gion has to await better angular resolution, in- 
direct information about its size can be inferred 
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from the temporal variability of the emission 
coming from that region. The observed gamma- 
ray variability time scales indeed reach down 
to the event horizon light-crossing time, AfsH = 
rg/c = G^M/(? = S.Sms minutes (S), vindicat- 
ing the scenario that the jets originate from the 
magnetospheres of accreting black holes. An 
example is the radio galaxy M87 in the Virgo 
cluster of galaxies {9, 10). This galaxy harbors a 
central supermassive black hole with the enor- 
mous mass of -6.4 x 10® JWg (11). M87 exhibits 
gamma-ray variability on a time scale of days 
(12), which is consistent with the light-crossing 
time of the event horizon Aisn (M87) = 0.4 days. 


Smaller than a black hole? 

Observations of Mrk 501 (13) and PKS 2155-304 
(14) at very high energies have provided evidence 
for extreme variability events, with flux doubling 
time scales as short as ~2 min. The ultrafast var- 
iability corresponds to At < AtBH and therefore 
casts a shadow of doubt on the current shock- 
in-jet paradigm. It has been suggested that rel- 
ativistic bulk motion of the jets could explain the 
observed time scales (Iff). The argument relies on 
the observation that these flaring sources belong 
to the class of sources that astronomers call bla- 
zars. In blazars, the jets are pointing at a small 
angle toward the observer. Because the jet plas- 


ma moves with a speed close to the speed of light 
p = Vj/c =: 1, leading to a bulk Lorentz factor 
Fj > 1, several effects arise due to the relativistic 
boosting of the emission (16). One of them affects 
the time scale of flux variations of the emission 
from a shock. The moving shock plasma almost 
catches up with its own radiation, and this leads 
to a shortening of the observed variability time 
scale At as compared with the variability time 
scale At' in a frame co-moving with the shock 
given by At = (1 + z)5"^Af', where z denotes the 
cosmological redshift of the source. For a given 
value of Fj , the Doppler factor 6 depends strongly 
on the orientation angle of the jet 9 (Fig. 1). Note 
that 6 = 0 corresponds to perfect alignment. For 
Mrk 501 and PKS 2155-304, almost perfectly 
aligned jets with Fj > 50 would be needed to 
accommodate for At < AIbh and to avoid self- 
absorption of the gamma rays because of pair 
production (IS). In blazars, interferometric ob- 
servations of the superluminal motion of radio 
knots suggest lower values of Fj-lO and orienta- 
tion angles of a few degrees (17). Larger values of 
Fj would lead to a problem with population 
statistics: The number of unbeambed counter- 
parts of blazars viewed at larger angles would 
then exceed the number of radio galaxies, which 
are commonly believed to represent the misaligned 
blazars (18). Assuming lower black hole masses 
would bring down AtBn- However, lower masses 
conflict with the firmly established dynamical 
measurements of black hole masses and there- 
fore do not seem to be a likely solution of the 
dilemma Other possible solutions of this Doppler 
factor crisis (19) invoke models of structured jets 
(20, 21) or Poynting flux-dominated jets, in which 
only a few but very fast seed particles at the jet 
base reach high Lorentz factors (22-24) before the 
Poynting flux is converted into the kinetic energy 
of the bulk flow by mass entrainment. 

AU of these attempts to explain the subhorizon 
scale variability with relativistic projection effects 
alone ignore a fundamental problem. If the per- 
turbations giving rise to the blazar variability are 
injected at the jet base, the time scale of the flux 
variations in the frame co-moving with the jet is 
affected by time dilation with Lorentz factor Fj. 
In blazars where 6~Fj, the Lorentz factor cancels 
out and the observed variability time scale Is 
ultimately bounded below by AtBn- 

1C 310: A gamma-ray lightning inferno 

IC 310 is a peculiar radio galaxy located in the 
outskirts of the Perseus cluster at a distance of 
260 million light years from Earth. On 12/13 
November 2012, MAGIC (Major Atmospheric 
Gamma-ray Imaging Cherenkov), a system of two 
Imaging Atmospheric Cherenkov telescopes lo- 
cated on the Canary island of La Palma (2S), de- 
tected an extraordinary outburst of gamma rays 
from this object (Fig. 2 and fig. SI). The details of 
the analysis can be found in section S2. Before 
these observations, variable gamma-ray emission 
from IC 310 had already been detected by satel- 
lite and ground-based gamma-ray instruments 
at GeV and TeV energies; e.g., Fermi-LAT and 
MAGIC (26-28). On the night of the flare in 


Fig. 1. The relation 
between Doppler 
factor and orienta- 
tion angle evaluated 
for various values of 
the bulk Lorentz fac- 
tor compared with 
the observational 
constraints of these 
parameters. The blue 
box shows the con- 
straints on the Doppler 
factor arising from 
radio observations 
( 32 ). For illustrative 
purposes, the red box 
shows the constraint 
from the gamma-ray 
optical depth to pair 
creation, assuming 
^syn' 1% Lvhe- 



Viewing angle [°] 



3.4 3.35 

RA [h] 

Fig. 2. Significance map (color scale) of the Perseus cluster in gamma rays observed on the night 
of 12/13 November 2012 with the MAGIC telescopes. The inset shows the radio jet image of IC 310 
at 5.0 GHz obtained with the European Very Long Baseline Interferometry Network (EVN) on 29 October 
2012. Contour lines (and the color scale associated with them) increase logarithmically by factors of 2, 
starting at three times the noise level (see the supplementary materials for image parameters). The ratio of 
the angular resolution between MAGIC and the EVN is 1:580 000. 
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November 2012, the mean flux above 300 GeV was 
(6.08 ±0.29) X 10“'^ cm"^s"'; that is, four times 
higher than the highest flux during previous 
observations in 2009/2010. The measured spec- 
trum (Fig. 3) can be described by a simple power 
law with a differential photon spectral index of 
r = 1.90 ± 0.04stat ± O.lSsyst in the energy range 
of 70 GeV to 8.3 TeV (table S2). Owing to its prox- 
imity, the spectrum of IC 310 is only marginally 
affected by photon-photon absorption in collisions 
with the extragalactic background light (EBL). 

IC 310 harbors a supermassive black hole with 
a mass of M = (Sl'j ) x 10®JWsun (section Sl.l), 
corresponding to an event horizon light-crossing 
time of AtsH = {23lj|‘) min. The mass has been 
inferred from the correlation of black hole 
masses with the central velocity dispersion of 
their surrounding galaxies {29, 30). The reported 
errors are dominated by the intrinsic scatter of 
the distribution. The same value of the mass is 
obtained from the fundamental plane of black 
hole activity {31). The scatter in the fundamental 
plane for a single measurement is larger and 
corresponds to a factor of ~7.5. 

During 3.7 hours of observations, extreme var- 
iability with multiple individual flares was detected 
(Fig. 4 and figs. S3 and S4). The flare has shown 
the most rapid flux variations ever observed in 
extragalactic objects, comparable only to those 
seen in Mrk 501 and PKS 2155-304. A conserva- 
tive estimate of the shortest variability time scale 
in the frame of IC 310 yields At /(I + «) = 4.8 min. 
It is the largest doubling time scale with which 
the rapidly rising part of the flare can be fitted 
with a probability > 5% (fig. S4). The light curve 
also shows pronounced large-amplitude flicker- 
ing characterized by doubling time scales down 
to At~l min. The conservative variability time 
scale corresponds to 20% of the light travel time 
across the event horizon, or 60% of it, allowing 
for the scatter in the dynamical black hole mass 
measurement. 

From the absence of a counter radio jet and 
the requirement that the proper jet length does 
not exceed the maximum of the distribution of 
jet lengths in radio galaxies, the orientation an- 
gle was found to be in the range 6 ~ 10° to 20° 
(section S1.2), and the Doppler factor consistent 
with 6 ~ 4 {32). These values put IC 310 at the 
borderline between radio galaxies and blazars. 
The jet power estimated from observations of the 
large-scale radio jet is ij = 2 x 10'*^ erg s“', as- 
suming that it contains only electrons, positrons, 
and magnetic fields in equipartition of their energy 
densities (section S1.3). For a radiative efficiency 
of 10%, the Doppler-boosted average luminosity 
of the jet emission amounts to 0.16''’Tj =: 5 x lO'*^ 
erg s"', which is close to the one observed in very 
high-energy gamma rays. For 5~4, the variability 
time scale in the co-moving frame of the jet, 
where it should be larger than Tj AtsH, is actually 
close to AtBH (Fig- !)■ A very high value of the 
Doppler factor is required to avoid the absorption 
of the gamma rays due to interactions with 
low-energy synchrotron photons, inevitably co- 
produced with the gamma rays in the shock-in- 
jet scenario. The optical depth to pair creation by 


the gamma rays can be approximated by % (10 TeV) 
~300 ( 6/4) (At /I min) (Lg,n erg s”^ ) .Adopt- 
ing a nonthermal infrared luminosity of ~1% of 
the gamma-ray luminosity during the flare, the 
emission region would be transparent to the 
emission of 10-TeV gamma rays only if 8 > 10. 


For the range of orientation angles inferred from 
radio observations, the Doppler factor is con- 
strained to a value of 6 < 6 (Fig. 1). One can spec- 
ulate whether the inner jet, corresponding to 
the unresolved radio core, bends into a just-right 
orientation angle to produce the needed high 



Fig. 3. Average spectral energy distributions during the flare (red) along with previous measure- 
ments of IC 310 as observed by MAGIC. We show the results from the high (blue, open squares) and 
low (black, open markers) states reported in (28) and the average results (gray triangles) from (27) for the 
whole period. The dashed lines show power-law fits to the measured spectra, and the solid line with solid 
circles depicts the spectrum corrected for absorption in the extragalactic background light. As a reference, 
the spectral power-law fit of the Crab Nebula observations from (25) is shown (gray, solid line). Vertical 
error bars show 1 SD statistical uncertainly. Because of the unfolding procedure, spectral points are 
correlated. Horizontal error bars show the energy binning. 
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Fig. 4. Light curve of IC 310 observed with the MAGIC telescopes on the night of 12/13 November 
2012, above 300 GeV. As a flux reference, the two gray lines indicate levels of 1 and 5 times the flux level of the 
Crab Nebula, respectively. The precursor flare (MJD 56243.972-56243.994) has been fitted with a Gaussian 
distribution. Vertical error bars show 1 SD statistical uncertainity. Horizontal error bars show the bin widths. 
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value of the Doppler factor (section S1.2). The prob- 
ability of such an alignment seems to be rather low. 
Moreover, the observed radio jet does not show 
any signs of a perturbation of its flow direction on 
the parsec and kUoparsec scales. Because perturba- 
tions of the flow direction of the inner jet would 
later propagate to larger scales, major bends ap- 
parently never occurred in the past 
In summary, trying to interpret the data in the 
frame of the shock-in-jet model meets difficul- 
ties. Considering the role of time dilation renders 
a solution of this problem impossible for any 
value of Tj. Therefore, we conclude that the ob- 
servations indicate a subhorizon-scale emission 
region of a different nature. 

Possible origins of 
subhorizon-scale variability 

Substructures smaller than the event horizon 
scale emitting highly anisotropic radiation (to avoid 
pair absorption) seem to be responsible for the 
minute-scale flux variations. The possible explana- 
tions faU into three categories: (i) mini-jet struc- 
tures within the jets (53); (ii) jet-cloud interactions, 
where the clouds may originate from stellar winds 
{34-36)-, and (iii) magnetospheric models (37-41), 
similar to those known from pulsar theory. 



Fig. 5. Scenario for the magnetospheric origin 
of the gamma rays: A maximally rotating black 
hole with event horizon rg (black sphere) accretes 
plasma from the center of galaxy 1C 310. In the 
apple-shaped ergosphere (blue) extending to 2rg 
In the equatorial plane, Poyntlng flux is generated 
by the frame-dragging effect. The rotation of the 
black hole Induces a charge-separated magneto- 
sphere (red) with polar vacuum gap regions (yellow). 
In the gaps, the electric field of the magnetosphere 
has a component parallel to the magnetic field that 
is accelerating particles to ultrarelativistic energies. 
Inverse Compton scattering and copious pair pro- 
duction due to interactions with low-energy thermal 
photons from the plasma accreted by the black hole 
lead to the observed gamma rays. 


According to the mini-jet model (i), plasmoids 
resulting from magnetic reconnection traveling 
down the jet with a relativistic speed are respon- 
sible for the minute-scale flux variations observed 
in blazars. The model could help to mitigate the 
constraints on the bulk Lorentz factor by intro- 
ducing a larger effective bulk Lorentz factor for 
the plasmoids. The mechanism also predicts re- 
connection events from regions outside of the 
beaming cone -T]"' that could explain the day- 
scale flares from the radio galaxy M87 invoking 
external radiation fields as a target for inverse 
Compton scattering (33). However, the off-axis 
mini-jet luminosity depends on (TjS)"®, and the 
jet power required for IC 310 is two orders of 
magnitude higher than the one estimated from 
radio observations (section S3.2). Thus, this mod- 
el is challenged by the observed high luminosity 
in IC 310 during the flares. 

Substructures smaller than the jet radius may 
also be introduced by considering interactions 
between clouds and the jet (ii). The original 
shock-in-jet model (1) considered this to be the 
main source of mass entrainment and predicted 
variability from the process. Recently, more elab- 
orate work on the model has had some success in 
explaining the variability of M87 by proton-proton 
collisions due to the bombardment of clouds 
boiled off of red giants with protons in the jet 
(36). However, the model is linked to the cloud 
crossing time of the jet and the proton-proton 
cooling time, both of which far exceed the event 
horizon scale. Faster variability could be observed 
if the cloud gets destroyed, but a strong beaming 
effect would then be needed to explain the ob- 
served luminosities. In another variant, drift ac- 
celeration of particles along the trailing shock 
behind the stellar wind of a star interacting with 
the jet is considered. This might lead to an ex- 
tremely anisotropic emission pattern. As mass- 
loosing stars sweep across the jet, passing magnetic 
field lines pointing to the observer, the postu- 
lated accelerated particle beams in their trails 
become visible for a short time. For IC 310, the 
emission would have to be eonfined to within an 
angle of a~10“® rad to explain the observed var- 
iability time scale, requiring a very stable direc- 
tion of the accelerated particle beams, at a large 
angle to the jet main thrust. Because two-fluid 
particle beams are prone to numerous plasma 
instabilities, the scenario relies on unphysical 
assumptions. 

In magnetospheric models (iii), particle accel- 
eration is assumed to be due to electric fields 
parallel to the magnetic fields. This mechanism 
is known to operate in the particle-starved mag- 
netospheres of pulsars, but it could also operate 
in the magnetospheres anchored to the ergo- 
spheres of accreting black holes (Fig. 5). Electric 
fields can exist in vacuum gaps when the density 
of charge carriers is too low to warrant their short- 
cut. The critical charge density for the vacuum 
gaps is the so-called Goldreich-Julian charge den- 
sity. Electron-positron pairs in excess of the 
Goldreich-Julian charge density can be produced 
thermally by photon-photon collisions in a hot 
accretion torus or corona surrounding the black 


hole. It has also been suggested that particles can 
be injected by the reconnection of twisted mag- 
netic loops in the accretion flow (39). A depletion 
of charges from thermal pair production is ex- 
pected to happen when the accretion rate becomes 
very low. In this late phase of their accretion 
history, supermassive black holes are expected to 
have spun up to maximal rotation. Black holes 
can sustain a Posmting flux jet by virtue of the 
Blandford-Znajek mechanism (4). Jet collima- 
tion takes place rather far away from the black 
hole at the scale of the light cylinder beyond 
~10rg. Gaps could be located at various angles, 
with the jet axis corresponding to the polar and 
outer gaps in pulsar magnetospheres leading to 
fan beams at rather large angles with the jet axis. 
The gap emission must be highly variable, be- 
cause gap height and seed particle content de- 
pend sensitively on plasma turbulence and accretion 
rate. For an accretion rate of m ~ 10"“'’ (in units of 
the Eddington accretion rate) and maximal black 
hole rotation, the gap height in IC 310 is expected 
to be h~0.2r^ (40), which is in line with the 
observations. Depending on the electron temper- 
ature and geometry of the radiatively inefficient 
accretion flow, its thermal cyclotron luminosity 
can be low enough to warrant the absence of pair 
ereation attenuation in the spectrum of gamma 
rays. In this picture, the intermittent variability 
witnessed in IC 310 is due to a runaway effect. As 
particles accelerate to ultrahigh energies, electro- 
magnetic cascades develop, multiplsmg the num- 
ber of charge carriers until their current shortcuts 
the gap. The excess particles are then swept away 
with the jet flow, until the gap reappears. 

Radio galaxies and blazars with very low ac- 
eretion rates allow us to obtain a glimpse of the 
jet formation process near supermassive black 
holes. The subhorizon variability in combination 
with the results from direct imaging campaigns 
invite us to explore analogies with pulsars, where 
particle acceleration takes place in two stages. In 
the first stage, particle acceleration occurs in the 
gaps of a charge-separated magnetosphere an- 
chored in the ergosphere of a rotating black hole, 
and in a second stage, particle acceleration oc- 
curs at shock waves in the foree-free wind be- 
yond the outer light cylinder. 
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Biosynthesis, regulation, and 
domestication of bitterness 
in cucumber 


Yi Shang,^’^* Yongshuo Yuan Zhou,*’"** Huimin Zhang, Lixin Duan,® 

Huiming Chen,® Jianguo Zeng,* Qian Zhou,^ Shenhao Wang,' Weiyia Gu,'’^ 

Min Liu,'’® Jinwei Ren,® Xingfang Gu,' Shengping Zhang,' Ye Wang,' 

Ken Yaxuhawa,® Harro J. Bouwmeester,'" Xiaoquan Qi,® Zhonghua Zhang,' 
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Cucurbitacins are triterpenoids that confer a bitter taste in cucurbits such as cucumber, 
melon, watermelon, squash, and pumpkin. These compounds discourage most pests on 
the plant and have also been shown to have antitumor properties. With genomics and 
biochemistry, we identified nine cucumber genes in the pathway for biosynthesis of 
cucurbitacin C and elucidated four catalytic steps. We discovered transcription factors 
6/ {Bitter leaf) and fit {Bitter fruit) that regulate this pathway in leaves and fruits, 
respectively. Traces in genomic signatures indicated that selection imposed on Bt during 
domestication led to derivation of nonbitter cucurbits from their bitter ancestors. 


P lant specialized metabolites play essential 
roles in mediating interactions between 
the plant and its environment and con- 
stitute a valuable resource in discovery 
of economically important molecules. In 
the plant family Cucurbitaceae, a group of highly 
oxygenated tetracyclic and hitter triterpenes, 
the cucurbitacins, mediated the coevolution be- 
tween cucurbits and herbivores. They serve ei- 
ther as protectants against generalists or feeding 
attractants to specialists (1-3). Widely consumed 
as vegetables and fruits, cucurbits were domes- 
ticated from their wild ancestors that had ex- 
tremely bitter fruits. Drought and temperature 
stress can increase the hittemess in certain do- 
mesticated cultivars, which can affect fruit qual- 
ity and marketability. Molecular insights into the 
occurrence and domestication of hittemess in 
cucurbits remain largely unknown. 

Despite their presence in fruits as a negative 
agricultural taste, cucurbitacins have for cen- 
turies been exploited for anti-inflammatory and 
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hepatoprotective activities, in the form of tradi- 
tional herbal medicines (4, 5). Bitter fruits and 
leaves of wild cucurbit plants have been used 
as a purgative and emetic in India (6). The bitter 
fruit stem of melon (in Chinese, “gua di”) is pre- 
scribed as a traditional hepatoprotective medi- 
cine whose effect and usage were well documented 
in Ben Coo Gang Mu, the Chinese Encyclopedia 
of Botany and Medicines composed by the Ming 
Dynasty physician li Shi-Zhen in 1590 CE. Recent 
studies revealed that cucurbitacins can cause cell- 
cycle arrest, apoptosis, and growth suppression of 
cancer cells through the specific inhibition of 
the Janus kinase-signal transducers and activa- 
tors of transcription (JzUC-STAT) pathway (7, 8). 
At present, their low concentrations in plants 
and nonspecific cytotoxicity limit their phar- 
maceutical applications. 

To date, plant metabolic diversification studies 
(9, 10), as well as recently reported gene clusters 
in plants [reviewed in (iJf)], indicate that clustering 
of functionally-related genes for the biosynthesis 
of secondary metabolites may well be a common 
feature of plant genomes. In cucumber, two inter- 
acting Mendelian lod were reported to control the 
bitterness, conferred predominantly by cucurbita- 
dn C (CuC) (3, 12). The Bi gene (1) confers bitterness 
to the entire plant and is genetically associated with 
an operon-like gene cluster (13), similar to the gene 
cluster involved in thalianol biosynthesis in Ara- 
bidopsis (14). Fruit bitterness requires both Bi and 
the dominant Rt (Bitter fruit) gene. Nonbittemess 
of cultivated cucumber fruit is conferred by bt, an 
allele selected during domestication as indicated 
by population genomics (IS). Exploiting these 
genetic clues, here we report the discovery of 11 
genes involved in the biosynthesis, regulation, 
and domestication of cucumber bitterness. 

First committed step in CuC biosynthesis 

To identify genetic variants associated with Si, a 
genome-wide association study was performed 
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based on the variation map {IS) of 115 diverse 
cucumber lines (Fig. lA and table SI). The most 
significant single-nucleotide polymorphism (SNP) 
was located within the region where Bi had 
been mapped and resulted in a nonssmonsunous 


change from cysteine (C) to tyrosine (Y) at residue 
393 (C393Y) of the cucumber gene Csa6G088690 
(Fig. IB). In the 115 lines, this SNP explained the 
phenotype in aU but one line, CG7744. In-depth 
analysis of the variation map identified a 1-base 
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Fig. 1. The Bi gene. (A) Genome-wide association study for the bitter foliage trait. Red arrow, most sig- 
nificant association. Scale, -logic of Rvalue of SNPs. (B) Amino acid alignment between wild Csa6G088690 
and two mutant alleles. (C) GC-MS analysis of extracts prepared from yeast INVScl that harbored Bi, two 
mutant alleles (G393Y and FS760), empty vector, or an authentic standard. TIC, total ion chromatograms; 
EIC 498, extracted ion chromatograms at a mass/charge ratio (m/z) of 498 [M-rTMS (trimethysilyl)]. 


pair (bp) deletion at Csa6G088690 in CG7744 
that resulted in a frameshift mutation at the 
760th amino acid residue (FS760) (Fig. IB). Ge- 
netic analysis pinpointed that Csa6GO88690 de- 
fines the Mendelian Bi gene (fig. SIA). 

Bi is a member of the oxidosqualene cyclase 
(OSC) gene family. Phylogenetic analysis showed 
that Bi is the ortholog of cucurbitadienoi syn- 
thase gene CPQ in squash {Cucurbita pepo) (16) 
(fig. SIB). We next used yeast to express Bi, as 
well as its two mutant alleles, C393Y and FS760, 
to test its biochemical function. As revealed by 
gas chromatography-mass spectrometry (GC-MS) 
analysis, formation of cucurbitadienoi occurred 
only in the yeast strain expressing the wild-type 
gene (Fig. 1C and fig. SI, C and D). Thus, in cu- 
cumber, Bi encodes a cucurbitadienoi synthase 
that catalyzes the cycUzation of 2,3-oxidosqualene 
into the tetracyclic cucurbitane skeleton, the first 
committed step of CuC biossmthesis (fig. SIE). 

A leaf-specific regulator of Bi 

To investigate the molecular mechanism in 
regulating CuC biosynthesis, we searched for 
naturally occurring mutants and screened an 
ethyhnethane sulfonate-induced cucumber mutant 
library and subsequently identified two non- 
bitter mutants (XY-3 and E3-231). The foliage 
expression level of Bi in the natural mutant XY-3 


Fig. 2. The Bi gene. (A) Expression of BI 
(Csa5G156220) and Bi in nonbitter mutant XY-3 
and bitter XY-2 cucumber lines (means + SEM, n = 
3). (B) Sequence alignment between wild BI and 
two mutated alleles. (C and D) Transient expres- 
sion of BI in cotyledons complemented the non- 
bitter phenotype of XY-3. (C) Expression of BI and 
Bi determined 7 days after agroinfiltration (means + 
SEM, n = 6). Value obtained from control (CK) was 
set to 1 and used to obtain relative values for the 
test sample. INF, sample infiltrated with BI: CK, 
sample infiltrated with empty vector. (D) Presence 
or absence of CuC detected by high-performance 
liquid chromatography (HPLC) analysis of extracts 
prepared from BI or control infiltrated cotyledons. 
mAU, milli-arbitrary units. (E) Schematic of the BI 
promoter region (2000 bp upstream of the start 
codon). Black vertical lines indicate locations of 
E-box motifs, and red horizontal lines indicate 
regions amplified in ChIP assays or used in EMSA. 
Localization of mutated E-box used in EMSA is in- 
dicated in red. (F) ChIP analysis of BI recruitment 
to the BI promoter region by PCR. ChIP assays 
conducted with or without (+/-) Myc antibody. 
INF, sample infiltrated with 6/; CK, sample infil- 
trated with empty vector. (G) qPCR analysis of BI 
recruitment to the indicated BI promoter region 
(means ± SEM, n = 3). (H) EMSA showing that Bl- 
His specifically binds, in vitro, to the E-box region 
within the Bi promoter. Lane identified by a red 
triangle indicates that the E-box element within the 
probe has been mutated from CANNTG to GANNTG. 
Comp, competitor (unlabeled probe): His, His-tag; 
+/-, presence or absence of protein or competitor; 
closed triangle, increasing amount of protein or 
competitor. 
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was reduced to less than 1% of that in the bitter 
isogenic line XY-2 (Fig. 2A), which indicated that 
cucurbitacin biosynthesis is disrupted in XY-3. 
Genomes of XY-2 and XY-3 were resequenced 
and compared to identify possible mutations. 
A SNP in the cucumber gene Csa5G156220 caught 
our attention, as it encodes a putative basic helix- 
loop-helix (bHLH) transcription factor (TF) ex- 
pressed specifically in leaves (table S2). The 
mutation resides at the splicing site of the pre- 
dicted intron that likely disrupts proper gene 
transcription (Fig. 2, A and B). 

Resequencing of E3-231 revealed another SNP 
located within Csa5G156220 that caused a change 
from arginine (R) to lysine (K) at the 85th amino 
acid residue (R85K), which is located inside the 
basic domain (Fig. 2B). This mutation may af- 
fect regulatory ability, as the basic domain is es- 
sential for DNA binding ability for bHLH TFs 
(27). Genetic analyses showed that the mutations 
in XY-3 and E3-231 are actually two recessive 
alleles of the same gene (fig. S2A). Increased 
expression of both Bi and Csa5G156220 was also 
observed in cucumber plants either exposed to 
drought stress or treated with the phytohor- 
mone ABA (fig. S2, B and C), which indicated 
that abiotic stress may stimulate the bitterness 
biosynthesis in cucumber by up-regulation of 
Csa5G156220. 

A cucumber cotyledon transient agro-irtfiltration 
expression system was developed to further con- 
firm the in vivo function of Csa5G156220 (18). 
Increasing expression of Csa5G156220 in XY-3 
cotyledons up-regulated expression of Bi, which 
in turn functionally complemented the nonbitter 
phenotype (Fig. 2, C and D, and fig. S3, A and B). 
Thus Csa5G156220 regulates the bitterness bio- 
ssmthesis in cucumber leaves, and hence, this 
gene was named BI (Bitter leaf). 

Next, we investigated how BI regulates Bi. Analy- 
sis of the Bi promoter region revealed the occur- 
rence of seven E-box (CANNTG) sequences (Fig. 
2E), a cis-element to which bHLH TFs could po- 
tentially bind (17). Yeast one-hybrid (YIH) assay 
and a tobacco transient reporter (luciferase) acti- 
vation system showed that BI indeed could bind 
to this promoter (fig. S2, D and E). Chromatin im- 
munoprecipitation (ChIP) assays were performed 
by using formaldehyde-fixed cotyledons of XY-3 
that were transiently expressing a Bl-Myc fusion 
protein. As revealed by the polsmierase chain re- 
action (PCR) products and quantitative real-time 
PCR (qPCR), BI was selectively recruited to the Bi 
promoter region containing E-box elements (Fig. 
2, F and G). Electrophoretic mobility-shift assays 
(EMSAs) also confirmed selective binding of BI to 
the E-box elements within the Bi promoter (Fig. 
2H). Thus, BI regulates cucurbitacin biosynthesis by 
activating transcription of Bi in cucumber leaves. 

A cucumber domestication gene 

52 was previously mapped to a 442-kilobase (kb) 
region on chromosome 5 that harbors 67 predicted 
genes (IS). BI and its two homologs (Csa5G157220 
and Csa5G157230) are among these candidates 
and clustered in an 8.5-kb region (Fig. 3A). As BI 
positively regulates Bi in cucumber leaves, we 


considered Csa5G157230 to be a candidate for 
Bt, given that it is specifically expressed in the 
fruit of the wild line, PI 183967, consistent with 
the distribution of bitterness in these plants 
(Fig. 3A and table S2). In addition, positive cor- 
relations were observed between expression lev- 
els of Csa5G157230 and Bi, and between fruit 
bitterness and gene expression in various cucum- 
ber lines, especially in those five extremely bitter 
wild lines (Fig. 3B). These studies established a 
correlation between Csa5G157230 expression and 
accumulation of bitterness in the fruit. 

Next, we performed a local association analy- 
sis within the 442-kb region to further identify 
genetic variants associated with the extremely 
bitter phenotype. This led to finding 11 signals at 
the regulatory region of Csa5G157230, including 


10 SNPs and one structural variant, a 699-bp 
insertion 2195 bp upstream of the Csa5G157230 
start codon (SV-2195). Another variant was also 
identified at the regulatory region of Csa5G157230, 
a SNP at the 1601 bp upstream of the start codon 
(SNP-1601), which cosegregated with the Bt locus 
in a large F 2 population (n = 1822). In the 115 
lines, 22 carrying a homozygous “A” at SNP-1601 
all bear nonbitter fruits (table S3). These analyses 
indicated that selection at the regulatory region 
of Csa5G157230 may down-regulate Csa5G157230 
expression in cultivated lines, which results in re- 
duced fruit bitterness. 

In some cucumber lines, fruits become bitter 
under stress conditions. For instance, the fruits 
of the cucumber line HAN become bitter when 
plants were grown at a low temperature (18°C 



Fig. 3. The Bt gene. (A) The B2-mapped region on chromosome 5 overlaps with a large domestication 
sweep region showing almost zero nucleotide diversity in the domesticated population (top). Differential 
expression profiles of genes predicted within the Bt region illustrated by a gradient in red (bottom). 
Numeric expression values of predicted genes are shown in table S2. Candidate Bt gene is indicated in 
red. CuC content of wild and cultivated cucumber was compared (means + SEM, n = 3).WF, wild fruit; CF, 
cultivated fruit; WL, wild leaf; CL, cultivated leaf. (B) High consistency observed between expression of 
Bt, Bi. and the CuC content in 21 cucumber lines, including five extremely bitter lines (means + SEM, n = 3, 
indicated in red). Presence or absence of SV-2195 indicated by +/-. Genotype of SNP-1601 (Y; A or G, U; 
unknown). (C) High consistency among cold-stress treatment; expression of Bt, Bi. and CuC content in 
fruit of HAN, ban, and Fj individual plants (means + SEM, n = 3). (D and E) Transient expression of Sf in fruit 
complemented the nonbitter phenotype of cucumber line XinTaiMiCi-2. (D) Expression of Bt and Bi 
determined 15 days after agroinfiltration (means + SEM, n = 3). Value obtained from control (CK) was set to 
1 and used to obtain relative values for the test sample. INF, sample infiltrated with Bt: CK, sample 
infiltrated with empty vector. (E) Presence or absence of CuC detected by HPLC analysis of extracts 
prepared from Bt or control infiltrated fruits 15 days after agroinfiltration. mAU, milli-arbitrary units. 
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day, 12°C night), whereas, at a normal temper- 
ature (30°C day, 22°C night), the fruits are not 
bitter. We identified a natural HAN mutant {han), 
whose fruits were nonbitter even under such low 
temperature conditions. Resequencing both lines 
revealed a mutation corresponding to SNP-1601 
(G in HAN and A in han). Genetic analysis showed 
that SNP-1601 cosegregates with the phenotype 
(fig. S4A.). Our qPCR analysis indicated that SNP1601 
is essential for regulating Bi expression in re- 
sponse to this environmental factor (Fig. 3C). 

To confirm the in vivo function of Csa5G157230, 
a fruit transient gene expression system was de- 
veloped (18). Expression of Csa5G157230 activated 
transcription of Bi and promoted biosynthesis 
of CuC in the fruit (Fig. 3, D and E). In parallel 
experiments, we expressed Csa5G157230 in XY-3 
cotyledons, with the method described above. An 
increase in CuC content in the infiltrated XY-3 
tissue was also observed (fig. S3C), which in- 
dicated that the TFs, BI, and Csa5G157230 have 
a similar biochemical function and that they 
control CuC biosynthesis in different organs. 
Next, we tested whether Csa5G157230 could di- 
rectly regulate the Bi gene. Here, we expressed 
the Myc-tagged protein in cotyledons of XY-3 
to prepare sufficient material for ChIP assays. 
Similar to BI, Csa5G157230 could bind to the 
E-box elements within the Bi promoter (fig. S4, B 
to F). Taken together, these studies provide strong 

Fig. 4. Nine pathway genes that are 
coordinateiy reguiated. (A and B) 

Identification of coexpressed candidate 
enzymes by analyzing transcriptomic 
data acquired from cultivar 9930 (A) 
and wild line PI 183967 (B). Candidate 
enzymes are indicated with different 
colors according to their annotations. 
Low-expressed gene Csa6G088180 is 
indicated with hatched green and was 
used as a negative control in the 
following analyses. (C to F) Coregulation 
of candidate genes (means + SEM, 
n = 3). Down-regulation of the nine 
genes in XY-3 as compared with XY-2 

(C) and han as compared with HAN 

(D) (asterisk indicates samples 
prepared from plants grown under low 
temperature), and up-regulation of 
the nine genes in the presence of ABA 
treatment (E) or drought stress (E). 

(G) Summary of interaction between 
candidate gene promoter and BI or Bt. 

Luc, luciferase trans-activation assay. 

(H) Eunction of enzymes elucidated by 
transient RNAi assays (means + SEM, 
n = 6). RNAi sample in blue: CK in red. 

Value obtained from control (CK) was 
set to 1 and used to obtain relative 
values for the RNAi sample. CK, sample 
infiltrated with empty vector. More 
information is provided in figs. S5 to S7. 


support for the hypothesis that Csa5G157230 is 
the Bt gene, which activates Bi and regulates CuC 
biosynthesis in the fruit. 

Nine genes in CuC biosynthetic pathway 

To catalyze the formation of CuC, eucurbitadienol 
has to be further modified with a series of oxi- 
dation reactions and acetylation, likely catalyzed 
by csdochrome P-450 enzymes (P450s) and an 
atyltransferase (ACT). On chromosome 6, Bi co- 
localizes with four P450 genes (Csa6G088160, 
Csa6G088170, Csa6G088180, and Csa6G088710) 
and one ACT gene (Csa6G088700) within a 35-kb 
genomic region. Except for Csa6G088180, all 
other genes shared nearly identical expression 
patterns, with high expression occurring in leaves 
of line 9930 as well as in fruits of wild line PI 
183967 (Fig. 4, A and B). In addition, these co- 
expressed genes were down-regulated in leaves 
of XY-3 as compared with XY-2 and in fruits 
of han as compared to HAN, and they were 
up-regulated in cucumber leaves under ABA treat- 
ment or drought stress (Fig. 4, C to F, and table 
S4). Furthermore, our studies showed that BI 
and Bt could specifically bind to the promoters 
of these coexpressed genes and could activate 
their transcription (Fig. 4G, and figs. S5 and 
S6). Mutation (R85K) within the basic domain 
of BI appeared to affect its binding ability to 
the CuC biosynthetic genes (fig. S5, C and D), 


which in turn is likely to result in the nonbitter 
phenotsqte of cucumber (E3-231). Although the 
YIH assay showed that Bt could also interact 
with the promoter of Csa6G088180 (fig. S5A), 
Bt cannot activate Csa6G08818ffs transcription 
(figs.S5B and S6C). 

We failed in a search for the specific P450 
within the cluster responsible for oxidizing cu- 
eurbitadienol, which suggests there should be 
other candidates located outside this 35-kb ge- 
nomic region. We reasoned that other genes 
would be coexpressed with the Bi cluster and 
coregulated by BI and Bt. Therefore, by applying 
the integrative bioinformatics and molecular 
biology approach described above, we Identify 
four additional P450 genes (three on chromosome 
3, Csa3G698490, Csa3G903540, and Csa3G903550, 
and one on chromosome 1, CsalG044890) that 
are coexpressed with the Bi cluster and are acti- 
vated by BI and Bt in leaves and fruits, respec- 
tively (Fig. 4, A to F, and table S4). 

The relation of CuC biossmthesis and these 
candidate tailoring enzymes was probed by using 
a transient RNA interference (RNAi) system acting 
on the bitter cotyledon of the cucumber line 9930 
(18). RNAi-mediated target-specific down-regulation 
of transcripts for aH these candidate genes resulted 
in a decrease in CuC content in the infiltrated 
cotyledons (Fig. 4H and fig. S7). Thus, BI and Bt 
regulate bitterness formation in leaves and fruits. 
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respectively, by direct transactivation of nine genes 
(one OSC, seven P450s, and one ACT) involved in 
the CuC biosynthetic pathway. 

Three more steps in CuC biosynthesis 

To characterize the biochemical function of 
these candidate P450s, we expressed each P450 
in the engineered yeast (EYIO) that accumu- 
lates 10 times as much cucurbitadienol as its 
original strain {18) (fig. S8). No expected product 
was detected from yeast extract at first (Fig. 5A). 
However, once an NADPH-cytochrome P450 
oxidoreductase gene {CPR, CsalG423150) was 
expressed with candidate P450 in the EYIO, we de- 
tected a specific product catalyzed by Csa3G903540 
(a member of CYP88 family, located outside the 
Bi cluster) (Fig. 5A). The structure of this purified 
product (compound 1) was interrogated by nuclear 
magnetic resonance (NMR) spectroscopy (figs. S9 
and SIO), which indicated that it was a derivative of 
cucurbitadienol in which the 19-CH3 was hydroxy- 
lated. The product of Csa3G903540 was named 
19-hydroxy cucurbitadienol. 


We continued to search for downstream 
P450S using this same approach. As revealed 
by liquid chromatography-mass spectrometry 
(LC-MS) assays, we identified an expected peak 
in the yeast expressing Bi, CPR, Csa3G903540, 
and Csa6G088160 (a member of CYP81 family, 
located within the Bi cluster) (Fig. SB). Tan- 
dem mass spectrometry (MS/MS) and NMR 
analysis revealed that a hydroxyl group was 
transferred to the C-25 position of 19-hydroxy 
cucurbitadienol and that the double bond be- 
tween C-24,25 was shifted to the position of 
C-23,24 (figs. Sll and S12). The product (compound 
2) of Csa6G088160 was named 19,25-dihydroxy 
cucurbitadienol. 

From fresh bitter cucumber leaves, our NMR 
analysis identified a deacetyl CuC (figs. S13 and 
S14, compound 3). LC-MS analysis showed that 
the ACT enzyme (Csa6G088700) was able to acetyl- 
ate this compound to yield CuC (Fig. SC). These 
studies indicate that Csa6G088700 is the enzyme 
involved in the final step in the CuC biosynthetic 
pathway. 
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Fig. 5. Three more catalytic steps. (A) GC-MS analysis of putative product (red arrow) generated by 
Csa3G903540 in the engineered yeast (EYIO). Partial enlarged details are shown as insets. TIC. total ion 
chromatograms: EIC 586, extracted ion chromatograms at m/z of 586. The product structure (right) was 
elucidated by NMR {18). (B) Ultra-performance liquid chromatography coupled with quadropole time-of- 
flight mass spectrometry (UPLC-qTOF-MS) analysis of yeast extracts with electrospray ionization (ESI) 
on positive mode. EIC 459.3833, extracted ion chromatograms of the accurate parent ion at m/z of 
459.3833. The product (indicated by red arrow) structure was elucidated by MS/MS and NMR {18). (C) 
UPLC-qTOF-MS analysis of the acetyltransferase catalytic reaction product. Deacetyl-CuC is acetylated 
by Csa6G088700-His in vitro (indicated by red arrow). A leaf-specific ACT (Csa5G639480) served as a 
negative control. Schematic of this biosynthetic pathway from deacetyl-CuC to CuC is shown at right. 


In summary, we discovered that two TFs regu- 
late nine genes in the CuC biosynthetic path- 
way and propose a model as to how extremely 
bitter wild cucumber was domesticated into 
nonbitter cultivars (fig. S15). As revealed in this 
study, such regulators must contribute to the 
highly coordinated and efficient transcription 
of plant specialized metabolic pathways. The 
new knowledge on cucurbitacin biosynthesis 
will open a door for biological manufacturing 
and engineering of these triterpenoids as anti- 
tumor drugs, for example, in a manner similar 
to the biosynthesis of artemisinic acid, the anti- 
malarial drug precursor {19, 20). 
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PALEOMAGNETISM 

Solar nebula magnetic fields recorded 
in the Semarkona meteorite 

Roger R. Fu/* Benjamin P. Weiss,' Eduardo A. Lima,' Richard J. Harrison,^ 

Xue-Ning Bai,^ Steven J. Desch,"' Denton S. Ebel,® Oement Suavet,' 

Huapei Wang,' David Glenn,^ David Le Sage,^ Takeshi Kasama,^ 

Ronald L. Walsworth,®’’^ Aaron T. Kuan^ 

Magnetic fields are proposed to have played a critical role in some of the most enigmatic 
processes of planetary formation by mediating the rapid accretion of disk material 
onto the central star and the formation of the first solids. However, there have been no 
experimental constraints on the intensity of these fields. Here we show that dusty 
olivine-bearing chondrules from the Semarkona meteorite were magnetized in a nebular 
field of 54 21 microteslas. This intensity supports chondrule formation by nebular 
shocks or planetesimal collisions rather than by electric currents, the x-wind, or other 
mechanisms near the Sun. This implies that background magnetic fields in the terrestrial 
planet-forming region were likely 5 to 54 microteslas, which is sufficient to account for 
measured rates of mass and angular momentum transport in protoplanetary disks. 


A stronomical observations of young stellar 
objects indicate that early planetary sys- 
tems evolve through a protoplanetary disk 
phase in <5 million years (My) following the 
collapse of their parent molecular clouds 
(2, 2). Disk evolution on such short time scales 
requires highly efBcient inward transport of mass 
accompanied by outward angular momentum 
transfer, which allows disk material to accrete 
onto the central star while delivering angular 
momentum out of the protoplanetary system. 

The mechanism of this rapid mass and angu- 
lar momentum redistribution remains unknown. 
Several proposed processes invoke a central role 
for nebular magnetic fields. Among these, the 
magnetorotational instability (MRI) and mag- 
netic braking predict magnetic fields with inten- 
sities of -100 pT at 1 astronomical unit (AU) in 
the active layers of the disk (3, 4). Alternatively, 
transport by magnetocentrifugal wind (MCW) 
requires large-scale, ordered magnetic fields 
stronger than -10 pT at 1 AU. Finally, nonmag- 
netic effects such as the baroclinic and Goldreich- 
Schubert-Fiicke instabilities may be the dominant 
mechanism of angular momentum transport in 
the absence of sufficiently strong magnetic fields 
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(5). Direct measurement of magnetic fields in 
the planet-forming regions of the disk can po- 
tentially distinguish among and constrain these 
hypothesized mechanisms. 

Although current astronomical observations 
cannot directly measure magnetic field strengths 
in planet-forming regions [(6, 7); supplemen- 
tary text], paleomagnetic experiments on me- 
teoritic materials can potentially constrain the 
strength of nebular magnetic fields. Chondrules 


are miUimeter-sized lithic constituents of prim- 
itive meteorites that formed in transient heat- 
ing events in the solar nebula. If a stable field 
was present during cooling, they should have ac- 
quired a thermoremanent magnetization (TRM), 
which can be characterized via paleomagnetic 
experiments. Besides assessing the role of mag- 
netic fields in disk evolution, such paleomagnetic 
measurements would constrain the currently un- 
known mechanism of chondrule formation. 

Chondrules likely constituted a sizable fraction 
of the mass of asteroids and terrestrial planet 
precursors and may have facilitated the accre- 
tion of the first planetesimals (S, 9). The forma- 
tion of chondrules therefore very likely represents 
a key stage in the evolution of the early solar 
system. The ambient magnetic field strength is 
a distinguishing characteristic among chondrule 
formation models. The x-wind model implies 
strong stellar fields of >80 to 400 pT (10). In 
contrast, magnetic fields in the nebular shock 
and planetesimal collision models are likely con- 
siderably lower than 100 pT (11, 12). 

Previous paleomagnetic measurements of in- 
dividual chondrules have focused mostly on the 
Allende CV chondrite (13). However, owing to ex- 
tensive aqueous alteration on the CV parent body, 
magnetic phases in Allende chondrules are sec- 
ondary and do not retain pre-accretional magnet- 
ization [i.e., magnetization acquired during the last 
heating of chondrules in the nebula and before the 
accretion of the meteorite's parent body; (14)). 
Reliable recovery of preaccretional magnetization 
requires samples that have avoided appreciable 
postaccretional remagnetization processes. 

Among the most pristine known meteorites is 
the Semarkona LL3.00 ordinary chondrite. We 
conducted paleomagnetic studies on Semarkona, 



r I Dusty olivines 200 pm 500 nm 


Fig. 1. Dusty olivine-bearing chondrules from the Semarkona meteorite. (A) Optical photo- 
micrograph of chondrule D0C4 showing the location of dusty olivine grains. Image taken in reflected 
light with crossed polarizers. (B) Annular dark-field scanning transmission electron microscope 
(STEM) image of four dusty olivine Fe grains from chondrule D0C5. Brightness in image reflects 
column-averaged atomic number: darker grains are smaller in size, implying a higher relative abundance 
of olivine at their location and hence a lower mean atomic number. The euhedral morphology and 
chemical homogeneity of the Fe grains is apparent, which indicate the lack of secondary recrystallization 
and alteration. Such Fe grains are the primary carriers of preaccretional magnetization in Semarkona 
chondrules. 
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focusing in particular on dusty olivine-bearing 
chondrules (Fig. 1). Dusty olivine crystals consist 
of submicrometer-sized grains of nearly pure 
body-centered cubic (bcc) Fe (kamacite) em- 
bedded in forsteritic olivine (IS). Such olivine 
grains are found in approximately 1 in 10 chon- 
drules in ordinary chondrites. 

Because of their unique compositional and mag- 
netic properties, dusty olivine grains are expected 
to retain preaccretional magnetization. The small 
grain size of dusty olivine metal implies that most 
are in the single domain (SD) or single vortex (SV) 
states, which can retain stable magnetization over 
the history of the solar system (16-19). Further, the 
Ni-poor composition (Ni <2 weight %) of dusty 
olivine metal precludes metamorphic recrystal- 
Hzation (15, 20). The domain states of dusty oEvine 
metals imply very high coerdvities ranging up to 
>200 mT, as confirmed by our demagnetization 
experiments (see below). The magnetization of 
grains with such high coerdvities should not have 
been much altered by the low shock pressures 
likely experienced by Semarkona [4 to 10 GPa 
(21, 22)). Finally, the low porosities of the sur- 
rounding olivine crystals have protected metal 
from aqueous alteration (Fig. IB). 

The distinctive, high coerdvities of dusty oli- 
vine grains allow for the isolation of their rem- 
anent magnetization during alternating field 


(AF) demagnetization as larger (10 to 100 pm) 
mesostasis metal grains are expected to demag- 
netize at AF levels <50 mT (23). Furthermore, 
because the postaccretional peak metamorphic 
temperature of Semarkona was likely only 200° 
to 260°C (24, 25), preaccretional remanence in 
dusty olivine metals should be isolated upon 
laboratory (1 hour duration) thermal demag- 
netization to <450°C assuming that metamor- 
phism lasted ~5 My (16, 19). In summary, no 
known postaccretional process is likely to have 
compromised preaccretional remanent magne- 
tization in Semarkona dusty olivines; strong-field 
AF demagnetization or thermal demagnetization 
above ~450°C is expected to isolate the preaccre- 
tional component of magnetization. 

We isolated eight dusty olivine-bearing chon- 
drules from two 15 mm by 10 mm by 150 pm- 
thick sections of Semarkona provided by the 
American Museum of Natural History (AMNH). 
Both sections contain fusion crust along one 
edge. We also extracted 5 non-dusty olivine- 
bearing chondrules and 29 bulk (i.e., mixed 
matrix and chondrule) samples to characterize 
any postaccretional overprints. All extracted 
samples are mutually oriented to within 5°. Owing 
to their weak moments [natural remanent mag- 
netization (NRM) ranging between 10”“ and 3 x 
10”’^ A-m^ before demagnetization], chondrules 


were measured with the superconducting quan- 
tum interference device (SQUID) microscope 
(26, 27) at the MIT Paleomagnetism Laboratory 
(Fig. 2). Supporting magnetic imaging measure- 
ments with higher spatial resolution were per- 
formed with a nitrogen-vacancy (NV) quantum 
diamond microscope (supplementary text). 

Bulk samples were subjected to AF demag- 
netization up to 85 to 145 mT or thermal de- 
magnetization up to 580°C. We identified three 
unidirectional postaccretional overprints in our 
Semarkona samples: two low coercivity (LCa and 
LCb) and one medium coercivity (MCa) compo- 
nents. Twenty bulk samples carry the LCa overprint 
blocked up to between 4.5 and 13 mT (Fig. 3). This 
component is present in both fusion crust material 
and meteorite interior samples, indicating that it 
was acquired after arrival on Earth. Removal of 
the LCa magnetization during thermal demag- 
netization to only 70°C indicates that it is likely a 
viscous remanent magnetization (VRM) acquired 
during long-term exposure to the Earth’s field 

In contrast, the MCa overprint is only present 
in samples within 4.7 mm of the fusion crust The 
mean paleointensity of this component based 
on the isothermal remanent magnetization (IBM) 
and anhysteretic remanent magnetization (ARM) 
normalization methods (76 pT) is within uncer- 
tainty of the Earth’s magnetic field. We conclude 
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Fig. 2. AF and thermal demagnetization of single dusty olivine-bearing 
chondrules measured with SQUID microscopy. Orthogonal projection dia- 
grams showing the evolution of the natural remanent magnetization (NRM) of 
two chondrules upon progressive demagnetization. Open and solid circles 
indicate the projection of the NRM vector onto the vertical (up-east) and 
horizontal (north-east) planes, respectively. (A) AF demagnetization of DOCl 
reveals a low coercivity (LC) overprint removed by 20 mT and higher 
coercivity (HO) magnetization that persists to >290 mT while decaying in 
magnitude toward the origin. Insets show associated magnetic field maps 


measured with the SQUID microscope at the indicated demagnetization 
levels where positive (red) field values are in the up direction. The stable 
directionality and steady decay of the magnetization can be seen during AF 
application above 20 mT. (B) Mixed AF and thermal demagnetization of 
DOCS shows the removal of the postaccretional LCb overprint by 20.0 mT 
(green points), a stationary moment between room temperature and ~400°C 
(gray), and an origin-trending high temperature (HT) component removed by 
780°C (red). Steps above 40 mT or 575°C have been averaged to suppress 
noise. 
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that the MCa component was acquired during 
heating from atmospheric passage. Finally, a small 
subset of seven samples from one edge of our 
Semarkona section carry the LCb overprint, 
which is completely removed by AF demagne- 
tization up to between 10.5 and 30 mX. The high 
intensity of the LCb component (NRM to satu- 
ration IRM ratio of 0.23) suggests that it was 
acquired during exposure to artificial magnetic 
fields. Only two dusty olivine-bearing chondrules 
carried the LCb overprint, which was fully re- 
moved upon AF application to 20 mX. Xo sum- 
marize, aU samples >1.0 mm from the fusion crust, 
which include aU dusty olivine-bearing chondrules, 
do not carry any postaccretional remagnetiza- 
tion other than the LCa, MCa, and LCb com- 
ponents. These overprints, if present, are readily 
identified and removed via AF cleaning. 

Eight dusty olivine-bearing chondrules were 
subjected to AF and thermal demagnetization 
up to 290 to 420 mX or 780°C. Six of these were 
found to carry a high coercivity (HC) or high 
temperature (HX) component of magnetization. 
We argue based on seven lines of evidence that 
these HC/HT components are preaccretional 
TRMs (supplementary text), (i) HC/HX compo- 
nents decay to the origin upon demagnetiza- 
tion, which is the expected behavior of primary 
magnetization (Fig. 2). (ii) The HC/FTX magne- 
tization directions in the six chondrules are col- 
lectively random, passing the conglomerate test 
at the 95% confidence level [Fig. 3; (28)]. No 
HC/HT component is oriented in the direction of 
any of the postaccretional overprints, (iii) The 
HC magnetizations in chondrules DOC3 and 
DOC4, which were each partitioned into two 
subsamples, are unidirectional within each chon- 


drule and inconsistent with random magnetiza- 
tions at the 99% confidence level. Such uniformity 
is expected of a XRM acquired by individual 
chondrules cooling in the solar nebula because 
the field should be uniform across the submiUi- 
meter scale of each sample (14). (iv) The blocking 
temperature range of the HX component (350° 
to 750°C) agrees closely with that expected of a 
preaccretional magnetization that was partially 
demagnetized for ~5 My at ~200°C [Fig. 2B; 
(16, 19)], which is the estimated metamorphic 
temperature for Semarkona on the LL parent 
body (16, 2S). (v) Magnetic field maps of dusty 
olivine-bearing chondrules confirm that the HC 
magnetization is carried by dusty olivines, which 
formed in the nebula and are expected to retain 
preaccretional magnetization as outlined above, 
(vi) The magnetization direction acquired dur- 
ing cooling for a spinning chondrule is expected 
to be parallel to its rotation axis (29). The close 
alignment between the HC directions and the 
short axes of our chondrules, which are likely 
related to the rotation axis (supplementary text), 
are nonrandom at the 98% confidence level, 
suggesting that HC/HX magnetizations were in- 
deed acquired parallel to the spin axis, (vii) The 
coercivity spectrum of the HC component of 
dusty olivine-bearing chondrules is very similar 
to that of an ARM and dissimilar to that of an 
IRM, which suggests that the HC component was 
acquired as a TRM (30). We therefore conclude 
with high confidence that the HC/HX magne- 
tizations observed in dusty oUvine-bearing chon- 
drules are XRMs acquired in the solar nebula. 

XRM acquisition experiments on analogs of 
dusty olivine chondrules have shown that the 
ARM normalization method potentially produces 


paleointensities accurate to -40% [2a; (30)]. As- 
suming that the five chondrules with recovered 
paleointensities formed in similar magnetic field 
conditions in the nebula, our nominal ARM pa- 
leointensities for six dusty olivines yielded a 
mean value of 27 pX. The morphology of chon- 
drules (31) and the apparent correspondence 
between HC magnetization directions and the 
short axes of our chondrule samples (see above 
paragraph) strongly suggest that chondrules were 
rotating during remanence acquisition in the so- 
lar nebula In the case of rotation around a single 
axis, which is the expected motion inherited from 
cooling of a viscous droplet, the true mean field 
intensity should have been greater than our ex- 
perimental paleointensity by a factor of 2 (sup- 
plementary text). Meanwhile, precession of the 
chondrule’s rotation axis would imply a multipli- 
cative factor of up to 4. However, given the high 
inferred rotation rates of chondrules (>50 s“^), 
the magnitudes of effects that may lead to pre- 
cession are comparatively small. Therefore, we 
adopt nonprecessing rotation as the most likely 
state of cooling chondrules and recommend a 
value of 54 ± 21 pX (2®) for the ambient nebular 
field strength. Although unlikely, if chondrules 
did not rotate or processed strongly during rem- 
anence acquisition, the corresponding estimated 
ambient field strength would be 27 ± 8 pX or 
108 ± 42 pX (20), respectively. 

Our paleointensity constrains the magnetic 
field environment during the last time the chon- 
drule cooled through the 765° to 350°C blocking 
temperature range of the HC/HT component, 
which likely was the chondrule-forming event. 
Our recommended paleointensity is significant- 
ly lower than the >80 to 400 pX expected for 
chondrules purportedly formed in the x-wind 
model (10). Furthermore, mechanisms that in- 
voke intense electric currents such as magnetic 
reconnection flares and current sheets predict 
strong fields >500 pX during chondrule heating 
(32). The short-circuit instability may also imply 
similarly strong fields at high temperature (33), 
although the decay of field strength below 765°C 
has not been studied in detail. In contrast, assum- 
ing that magnetic fields inherited from the col- 
lapsing molecular cloud were on the order of 
10 pX (12), nebular shocks, which may enhance 
the ambient magnetic field by a factor of <10, 
would result in paleointensities of <100 pX (11). 
Meanwhile, planetesimal collisions would likely 
not perturb the background field. Therefore, 
nebular shocks and planetesimal collisions are 
the chondrule formation models most consist- 
ent with our measured paleointensities. 

Adopting nebular shocks and planetesimal 
collisions as the most likely origins of chon- 
drules, background magnetic fields in the neb- 
ula may have been amplified by a factor between 
1 and 10 during chondrule formation. We there- 
fore infer that background magnetic fields in the 
solar nebula were between 5 and 54 pX (11). 
Assuming Semarkona chondrules formed near 
2.5 AU, which is the present-day location of 
S-t 5 q)e asteroids (34), the vertical distribution 
of dust in the nebula strongly suggests that 

28 NOVEMBER 2014 • VOL 346 ISSUE 6213 1091 


Fig. 3. Magnetization direc- N 

tions in Semarkona chon- 
druies and buik sampies. 

Equal area stereonet 
projection diagram where 
colored points denote high 
coercivity or high temperature 
(HC/HT) magnetization in 
individual dusty olivine-bearing 
sample or subsamples. Circles 
indicate the associated maxi- 
mum angular deviation (MAD) 
obtained from principle com- 
ponents analysis. Each color 
represents a single chondrule 
with chondrules D0C3 and 
D0C4 having two subsamples 
each. Black stars and associ- 
ated ovals represent the mean 
directions of the three postac- 
cretional overprints identified 
from bulk samples and their 
95% confidence intervals. 

Open symbols represent the 

upper hemisphere; solid symbols represent the lower hemisphere. The wide scatter of the HC/HT 
magnetizations, the unidirectionality of subsamples from D0C3 and D0C4, and their noncorrespon- 
dence to the directions of postaccretional overprints provide strong evidence for a preaccretional origin 
of the HC/HT magnetizations in dusty olivine-bearing chondrules. 
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chondrule formation took place in the weakly 
ionized “dead zone,” which contains gas poorly 
coupled to local magnetic fields and occurs within 
~3 gas scale heights of the midplane (33). Our 
measurements therefore indicate that a substan- 
tial magnetic field [ yet stiH well below the -400 |rT 
equipartition field strength (3)] existed in the dead 
zone, potentially as a result of fields inherited irom 
the collapse of the solar system’s parent molecular 
cloud Given our measured field strengths, mass ac- 
cretion driven by the MRI or magnetic braking at 
2.5 AU would have been <0.04 x 10“® to 3.5 x 10“® 
JWsun year“\ where is the Sun’s mass (sup- 
plementary text). Meanwhile, the MCW model 
would predict mass accretion rates of 0.3 x 10“^ 
to 30 X lO”"^ Msim year”' or less. The inferred age of 
Semarkona chondrules is 2 to 3 My after the first 
calcium alunimum-rich inclusions (36). Given that 
protoplanetary disks are observed to have ac- 
cretion rates of 10“®-10“'^ Mjun year”' at 2 to 3 My 
after collapse of their parent molecular clouds (2), 
both magnetic mechanism could fully account for 
the expected accretion rates. This suggests that 
magnetic fields govern the observed rapid trans- 
formation of protoplanetary disks into planetary 
systems around Sun-like stars. 
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Dynamic mechanical behavior of 
muItUayer graphene via supersonic 
projectile penetration 

Jae-Hwang Lee,'’^* Phillip E. Loya,' Jun Lou,' Edwin L. Thomas'* 

Multilayer graphene is an exceptional anisotropic material due to its layered structure 
composed of two-dimensional carbon lattices. Although the intrinsic mechanical 
properties of graphene have been investigated at quasi-static conditions, its 
behavior under extreme dynamic conditions has not yet been studied. We report the 
high-strain-rate behavior of multilayer graphene over a range of thicknesses from 
10 to 100 nanometers by using miniaturized ballistic tests. Tensile stretching of the 
membrane into a cone shape is followed by initiation of radial cracks that approximately 
follow crystallographic directions and extend outward well beyond the impact area. 

The specific penetration energy for multilayer graphene is -10 times more than literature 
values for macroscopic steel sheets at 600 meters per second. 


G raphene, the atomic monolayer building 
block of graphite, is known for its excep- 
tionally high intrinsic strength and stiff- 
ness arising from the two-dimensional (2D) 
hexagonal lattice of covalently bonded 
carbon atoms. Recently, graphene’s in-plane 
Young’s modulus (T||) was measured to be more 
than 1.0 TPa using atomic force microscope 
nanoindentation (1). Because tensile mechani- 
cal stresses in a material cannot be transmitted 
faster than the speed of sound [c - (Y/p)'^/ where 
p is the density of the material], the nonequil- 
ibrium local stress arising from the inertial effect 
becomes important under dynamic conditions 
accompanying high-strain-rate, predominantly 
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tensile loading (2). In this regard, the relatively 
low density (-2200 kg m”^) of graphene (5), 
along with its high modulus, leads to a superior 
in-plane speed of sound (cn - 22.2 km s”'), im- 
plying that concentrated stresses applied under 
extreme conditions can rapidly be delocalized. 

Nanoindentation has served as an effective 
technique to study the tensile mechanical prop- 
erties of monolayer graphene. It is inherently 
a low-speed test (<<1 m s”'), but strain rates 
can reach -10^ to 10® s”' for very thin samples 
(4), whereas most high-speed, high-strain-rate 
mechanical characterization techniques, such as 
split-Hopkinson pressure bar (5) and ballistic 
tests (6), are inappropriate for testing very thin 
specimens. To address high-speed and high- 
strain-rate tensile-dominated penetration of thin 
films, we improved our laser-induced projectile 
impact test (LIPIT) (7). In this advanced LIPIT 
(or “a-LIPIT”), a single micrometer-size solid 
silica sphere (or “jr-buUet”) is fired at a high speed 
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(<3 km s“^) with a high aiming accuracy (<1.1° 
deflection) toward a thin film. The velocity of 
the |i-hullet is measured before and after pen- 
etration to determine the energy lost during the 
test. We employed multilayer graphene (MLG) 
membranes in a range of thicknesses (10 to 
100 nm, equivalent to 30 to 300 graphene layers) 
to apply localized, very-high-strain-rate tensile 
deformation (~10^ s“') at a specific area. As the 
thickness of the MLG membranes Qi) is always 
considerably less than the diameter (D) of a 
|i-bullet {DIh > 40), the high-strain-rate in-plane 
tensile behavior of MLG can be assessed from 
the thickness-dependent characteristics of the 
energy required for projectile penetration through 
the membrane. 

The MLG membranes are prepared by me- 
chanical exfoliation of highly ordered p 5 Tol 5 dic 
graphite (grade SPl-1, SPI Supplies; fig. SI), as 
shown in Fig. lA A silica p-buUet [D = 3.7 ± 0.02 pm, 
based on imaging by scanning electron micros- 
copy (SEM)] is propelled by expanding gases cre- 
ated by the laser ablation of a gold film (~50 nm 


thick). A 20-pm-thick elastomeric layer of cross- 
linked polydimethylsiloxane is used to confine 
the ablation products, eliminates the temperature 
rise of the p-bullet, and diminishes the strength 
of the shock waves propagating through air. 
The impact speed (wj is measured (with the re- 
producibility of Vi within ± 2%) using a triple- 
exposure photograph of the moving p-bullet 
with ± 10 m s“' error, where the time gap be- 
tween the three exposures is achieved by em- 
ploying different travel distances for each laser 
pulse (fig. S2). The residual speed (u^) of the 
p-bullet is similarly measured after the p-bullet 
has traveled a certain distance (d) beyond the 
membrane (Fig. IB). An average thickness (/iave) 
of the impact area was determined using a 
thickness-dependent optical transmittance mea- 
surement (fig. S3) given by an analytical formula 
(S) and the optical parameters of graphite (9). 
An additional postpenetration image allowed 
identification of the penetration area (Fig. 1C). 

The schematic in Fig. ID depicts the series of 
events during penetration accompanying the ki- 


netic energy loss of a |i-bullet (ABj;)- As the 
p-bullet impacts a strike face area (As = 7tD^/4), 
an elastic wave radially propagates at Cj and a 
conic deformation of the MLG membrane fol- 
lows with a radial speed of its base, v^, which is 
generally slower than C\\ (step ii). The primary 
force is axissTnmetric tension with a strong ra- 
dial gradient and results in the cone shape. 
Typically, three to six cracks are initiated near 
the center of As and propagate outward in the 
radial direction (step iii), resulting in the crea- 
tion of the same number of petals. The trans- 
ferred momentum to the MLG membrane induces 
creasing and folding of each triangular-shaped 
petal at its base while the elastic extension of the 
membrane is rapidly relaxed along the radial 
direction via snap-back (step iv). The longest crack 
from the impact center is defined as the maxi- 
mum crack distance, Lmux, which we use as the 
estimation of the final radius of the conic de- 
formation due to the reduction of the tangential 
stress. ABk is composed of two terms, the net en- 
ergy to penetrate a membrane (Bp) and the energy 
loss due to air drag Bp depends on various 
energy dissipation mechanisms including elastic 
stretching of the membrane, fracture, and heat- 
ing, as well as the kinetic energy transfer to the 
membrane petals and membrane debris. 

ABk = =-Bp +Bair{d) (1) 

As the mass of a p-bullet, m is calculated to be 
5.0 ± 0.1 X 10”'“^ kg, based on the measured di- 
ameter D and the density of silica (1900 kg m“^) 
provided by its vendor (microParticles GmbH). 
The incident Bj; is 9 nj at 600 m s“^ and 21 nj at 
900 m s”^ while ABk is in the range of 1 to 5 nJ. 
The measured deceleration by air drag (a^ir) is 
-0.6 X 10^ m s“^, assuming constant deceler- 
ation, which yields = ma^ -1.07 nJ for d - 
350 pm (the travel distance of a p-bullet after 
penetration when we take the triple exposure). 
Therefore, the primary contribution to the ki- 
netic energy loss is the net energy to penetrate 
a MLG membrane (Bp) under our experimental 
conditions. 

The MLG membranes had a typical lateral 
grain size of -10 pm. Due to the stress concen- 
tration at the impact site, the grain boundary 
effects (70, 77) (if any; see the supplementary 
materials) would occur only if the grain bounda- 
ries exist within Aj. As the ratio, D/h is quite large 
(40 to 350), the mechanical response of the MLG 
film depends primarily on its in-plane tensile 
strength under a high strain rate. A typical pe- 
netration hole features a set of petals (Fig. 2). The 
area directly beneath the p-buUet impact shows 
extensive damage through complex, fine-scale 
fractures, folding, delamination, and loss of parts 
of the membrane (indicated by the yellow ar- 
rowheads). As the initiation of the radial cracks 
may not be exactly at the impact center, asymme- 
tric shaped petals are often observed (e.g.. Fig. 2, A 
and D). The damage area is thus much wider than 
As, in strong contrast to the observed behavior of 
polycrystalline gold and amorphous, glassy 
poly(methyl methacrylate) (PMMA) membranes. 



Highly -deformed tip 

\ 



Radially 

propagating 

cracks 


Fig. l.The microballistic experiment. (A) Scheme of the experiment. PDMS, polydimethylsiloxane. (B) 
Side-view image of a moving p-bullet taken by triple exposure at time steps L to ts. (C) MLG membrane 
on a sample holder after a-LIPIT. Three separate impact test regions are highlighted by green backlight. 
(D) Schematic illustration of penetration steps: (i) prepenetration stage; (ii) conic deformation stage; (iii) 
fracture stage: and (iv) postpenetration stage, showing the film morphology after penetration and 
relaxation. Scale bars in (B) and (C), 50 pm. 
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in which penetration results in a circular hole with 
an area of (see figs. S6 and S7). Correspondingly, 
a much smaller penetration energy is measured. 

Many independent penetration experiments 
(47 events for w, = 600 m s”^ and 43 events for = 



900 m s”^) were carried out for statistical analysis 
(figs. S4 and S5). The average apex angles (0a in 
Fig. ID) of petals for the 600- and 900-m s“^ 
projectile velocities are 83 ± 21° (for 107 cracks) 
and 70 ± 23°(for 93 cracks), respectively, indicat- 



ing that the higher tangential stresses induced 
at 900 m s“^ were relaxed through more radial 
cracks. Despite the in-plane isotropic elastic na- 
ture due to the approximate sixfold symmetry of 
graphene (72), the preferential crack propagation 



I I I V— 1 I 

0 30 60 90 120 150 

Angle between adjacent cracks, (°) 




0 30 60 90 120 150 

Angle between adjacent cracks, (°) 


Fig. 2. Representative penetration features of MLG membranes. (A and B) SEM images of petals, radial cracks, folds, and snap-back damage to the petal 
tips and (C) the adjacent crack-pair angle distribution for v, = 600 m s“l (D and E) SEM images and (F) the adjacent crack-pair angle distribution for y = 900 m s“^ 
The inset in (F) shows the armchair (red) and zigzag (green) directions. The circles in the SEM images show A^. Scale bars in (A), (B), (D), and (E), 5 gm. 


Fig. 3. Damage 
features of a thin 
MLG membrane. (A) 

Bright-field TEM 
micrograph of the 
impact region 
(have ~ 10 nm) for 
y = 900 m s“\ (B) 
Higher magnification of 
the petal apex area 
indicated by the red 
arrowhead in (A). 

(C) Three dark-field 
TEM micrographs are 
overlaid to show bend 
contours, rotation-tilt 
moire fringes, and a 
fine-scale mosaic 




texture resulting from 
snap-back and 
membrane folding. 

The two diffraction 
patterns show (Cl) a 
typical hexagonal 
spot pattern of the 
undeformed film and 

(C2) the altered, multireflection-satellite spot pattern from near the impact region. The red and blue regions correspond to imaging with g = (0110), and 
the green region corresponds to g = (2110). Scale bars, (A) 5 gm; (B) and (C), 0.5 gm. The voidlike features at the upper left in (A) are a result of residual 
water-soluble polymer from film preparation. 
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directions (1100) (the armchair direction) and 
(2110) (the zigzag direction) cause preferred an- 
gles between adjacent cracks to he a multiple of 
30° (13). Correlation to the underlying crystallo- 
graphic orientation of the membrane is noted in 
the distribution of the angle between adjacent 
cracks displaying preferences for small multiples 
of 30° (Fig. 2, C and F). 

Transmission electron microscopy (TEM) was 
carried out on snap-back portions of the petal 
regions of thin (/tave ~ 10 nm) fractured mem- 
branes (Fig. 3). The bright-field TEM image shows 
complicated local folding of the membrane near 
the penetration (Fig. 3, A and B). The higher- 
magnification dark-field image shows bend 
contours and moire fringes (upper left and left 
center regions in Fig. 3C, resulting from the 
interference of electrons scattered from super- 
posed folded MLG regions; see diffraction pat- 
tern insets in Fig. 3C). In the extensively folded 
region nearest the impact origin, a fine-scale 
mosaic structure is evident in both the electron 
diffraction pattern and dark-field images due to 
the deformation resulting from the rapid elastic 
relaxation (i.e., petal snap-back). 

Despite a relatively wide fluctuation in 
its lower limit is well fit by 0.1/tave T)/2 (Fig. 4A). 
The film thickness inhomogeneity is represented 
by the coefficient of variation (CV) of the local 
thicknesses measured over a circular area (radius 
r = Aniax)- MLG membranes that have CV > 10% 
in the impact area clearly lead to a greater fluctua- 
tion of Lmax- The radial speed of the circumferential 
base of the expanding cone-shape deformation 
region of a membrane can be approximated by 
Wc £ l-23C||[Ui/2'''^C||)]^^^ and thus scales with the 
cube root of the in-plane speed of sound in the 
material (14). Values for correspond to 1950 
and 2560 m s”' for impacts of 600 and 900 m s”\ 


respectively. An empirical estimation of deforma- 
tion parameters is then possible by setting the lower 
limit of Lmax to the maximum radius of the cone, 
because the tangential tensile stress (Fig. ID) is 
the origin of the radial cracks. For example, assum- 
ing a simple ID model for a 900-m s“^ impact to a 
50-nm-thick MLG membrane, we estimate the 
penetration time <p s Lmax/t'c ~ 3 ns; the ID- 
approximate average maximum tensile strain 
Emax s (Wifp/Lmax)^/2 = ~6%, close to the lowet 
boundary of the reported failure strain range, 5 
to 25% (1, 15, 16); and the ID-approximate av- 
erage tensile strain rate as given by the maxi- 
mum strain divided by the penetration time or 
Emax — (t'i/Tmax)^fp/2 ~ 10'^ s ^ A further dis- 
cussion of the estimation of strain and strain rate 
is available in the supplementary materials. 

From Eq. 1, AE'i;(^ave) can be fit with a linear 
function (Fig. 4B), where the ^/-intercept value 
corresponds to Eau- Therefore, the intrinsic en- 
ergy dissipation of a MLG membrane is given by 
^p(^ave) = 0.026/tave and 0.030/tave for the two 
velocities we employed, and a similar trend is 
also found in macroscopic ballistic tests (17, 18). 
For Dlh^,^ >>!,£), can be expressed by two terms, 
Ep = (pAsAave)t'f/2 + iid, where the first term 
represents the minimum inelastic energy trans- 
fer to target material within andiid represents 
all of the other energy dissipation mechanisms. 
The specific energy dissipation, E*=Ep{pAsh,ive)~^ , 
which is insensitive to material density by taking 
account of the mass within Aj/tave, is given by 
Ep = v\/2 +E^, where E^ is the specific delo- 
calized penetration energy. E^ is thus a figure 
of merit to evaluate the impact energy delocal- 
ization ability of a material as more sample mass 
beyond As contributes to the energy dissipa- 
tion, whereas the material-independent energy 
dissipation term, vf/2, serves as a baseline. 


Statistical values of Eph~^ for MLG, PMMA, 
and polycrystalline gold were determined from 
the fitted slopes of Ep versus /tave (Fig. 4B and 
figs. S6E and S7C). From this data (see square 
data points in Fig. 4C), MLG exhibits the highest 
Ep (or Ed), namely 1.26 MJ kg“^ (or 0.86 MJ kg“') 
at 900 m s”^, compared with 0.58 MJ kg“^ (or 
0.19 MJ kg“^) for gold and 0.52 MJ kg“^ (or 
0.08 MJ kg“') for PMMA. We also calculated 
Ep from previous macroscopic ballistic tests of 
several materials: PMMA (19), aluminum (20, 21), 
steel (22, 23), and Kevlar KM2-polyvinyl butyral 
(PVB) composite fabric (24). These macroscopic 
tests used a millimeter-scale spherical steel pro- 
jectile to penetrate a thin sheet (0.4 <h<6 and 
1 < D/h < 20) without appreciable deformation of 
the projectile (table SI). The overall trend of£* is 
quite similar, despite the large differences in the 
microscopic a-LIPlT and traditional macroscopic 
tests and the huge range in tensile modulus, strength, 
and density among PMMA, aluminum, and steel 
This implies that the principal energy dissipation 
mechanism for the three macroscopic materials is 
the kinetic energy transfer from the projectile to 
the target material within A^ (i.e., to the mass 
pAJi), which results in a dishing process (25). This 
is why the E* values of PMMA and gold from 
a-lIPlT also follow this trend and indicate a good 
correspondence between the micro- and macro- 
scopic high-strain-rate evaluation methods for 
these materials, which also display localized pen- 
etration via a dishing process. 

However, the two highly anisotropic layered 
materials— the Kevlar KM2-PVB composite fab- 
ric and the MLG membrane— deviate strongly 
from this trend. The Kevlar KM2-PVB composite 
is an armor-grade laminate made of strong, stiff 
polyaramid fibers (strength ~ 4 GPa, F~ 84 GPa, 
c ~ 7.6 km s”') embedded in PVB resin (26). The 
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Fig. 4. Analyses of microballistic results. (A) Maximum crack distances in MLG membranes of various thicknesses for the two different penetrator 
velocities. (B) Kinetic energy changes of a g-bullet versus thickness after penetration of MLG membranes. Colors in (A) and (B) represent the thickness 
inhomogeneity (via CV) of the film in the impact area. (C) Specific penetration energy of MLG, PMMA, and gold membranes compared with macroscopic 
materials at various impact velocities. The density of each material is represented by a logarithmic color scale. Error bars denote SD. 
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planar fourfold symmetric high-stiffness woven 
polyaramid multilayer fabric shows an extensive 
conelike deformation under ballistic impact, and 
the higher £* (or E^) values (see blue triangles 
in Fig. 4C) can be understood by the contribu- 
tion of large sample mass well beyond the im- 
pact area to energy absorption (27). Similarly, 
the superior value of for MLG (see green 
squares in Fig. 4C) can be explained by its abil- 
ity to simultaneously be stiff, strong, and elas- 
tic, stretching into a cone shape due to the force 
imparted by the forward moving projectile. As 
a result, the E^ values of the MLG membrane 
(0.92 and 0.86 MJ kg“^ for 600 and 900 m s“^) 
substantially surpass those of steel (0.08 and 

0. 11. MJ kg“^) at the same Vi. Therefore, the MLG 
demonstrates specific delocalized penetration 
energy 8 to 12 times higher than that of steel 
due to the strong delocalization behavior at these 
impact speeds. As the higher delocalization effect 
results in a wider penetration hole, this tendency 
could be disadvantageous in certain aspects such 
as multi-hit capability. However, this potential 
weakness of MLG will be substantially relieved 
when the crack propagation is deflected by form- 
ing a composite. 

Our microscopic ballistic results reveal that the 
superior in-plane speed of sound, hi^ strength, 
stiffness, and structural anisotropy make MLG an 
extraordinary armor material exhibiting excellent 
impact energy delocalization under a supersonic 
penetration event. Because material far beyond 
the strike face area can also consume kinetic en- 
ergy from the p-bullet while the MLG membrane 
sustains high dynamic tensile stress, the p-buUet 
effectively experiences a higher areal density ma- 
terial. As large-scale production of graphene-based 
composite materials is becoming possible (28), 
other graphene-like materials are being studied 
(29), and the results suggest opportunities for the 
use of ordered anisotropic nanocomposites for sur- 
prising mechanical applications. The good corre- 
spondence between the micro- and macroscopic 
projectile penetration tests, especially in the mea- 
sured specific energy absorption, suggests that 
the microballistic method with its high-energy 
resolution may offer an effective means for the 
exploration of high-strain-rate physics of vari- 
ous materials, as well as practical advantages in 
rapid, high-throughput testing. 
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REPELLENT SURFACES 

Turning a surface superrepellent 
even to completely wetting liquids 

Tingyi “Leo” Liu^ and Chang-Jin “CJ” 


Superhydrophobic and superoleophobic surfaces have so far been made by roughening a 
hydrophobic material. However, no surfaces were able to repel extremely-low-energy 
liquids such as fluorinated solvents, which completely wet even the most hydrophobic 
material. We show how roughness alone, if made of a specific doubly reentrant structure 
that enables very low liquid-solid contact fraction, can render the surface of any material 
superrepellent. Starting from a completely wettable material (silica), we micro- and 
nanostructure its surface to make it superomniphobic and bounce off all available liquids, 
including perfluorohexane. The same superomniphobicity is further confirmed with identical 
surfaces of a metal and a polymer. Free of any hydrophobic coating, the superomniphobic 
silica surface also withstands temperatures over 1000°C and resists biofouling. 


T he ability to understand the extraordinary 
liquid repeflency of natural surfaces (f, 2) has 
affected a wide range of scientific and tech- 
nological areas, fiom coatings (3), heat trans- 
fer (4), and drag reduction (5) to biomimetics 
(6). Whereas the wetting-resistant surfaces devel- 
oped since the 1960s (7-10) used only surface rough- 
ness to trap gas with no interest in the apparent 
contact angles, superhydrophobic surfaces created 
since the late 1990s (1, 11, 12) combined the rough- 
ness with a hydrophobic material to superrepel 
water— that is, to display a very large apparent con- 
tact angle (0* > 150°) and a very small roU-off angle 
(0rDn<jff < 10°). For low-energy liquids such as oils 
or organic solvents, a roughness with overhanging 
topology was necessary to make the hydrophobic 
material superoleophobic (13, 14), omniphobic 
(IS), or superomniphobic (16, IT). Despite the use of 
the prefix “omni-” (6, 15-18), however, no natural 
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or man-made surface has been reported to repel 
liquids of extremely low surface tension or energy 
(Le., Y < 15 mJ/m^), such as fluorinated solvents, 
which completely wet existing materials (10, 19-21). 
Departing from the prevailing approach of rough- 
ening a hydrophobic material, we first propose 
that the material’s inherent wettability, depicted by 
the intrinsic contact angle By, is irrelevant when 
dealing with a completely wetting liquid (By = 0°). 
Focusing instead solely on the roughness details, 
we develop a surface that superrepels all available 
liquids, including fluorinated solvents— for instance, 
perfluorohexane (CgF^; namely, 3M Fluorinert 
FC-72), whose surface energy (y = 10 mj/m^) is 
the lowest known and which has never been ob- 
served to bead up, let alone roll off, on any surface. 

To avoid the confusion with the petal effect 
(22), in which a droplet with large contact angles 
sticks to the surface, it helps to first clarify that 
“repelling” means that droplets not only bead on 
but also roll off the surface. To repel (i.e., 6* > 90° 
with a small 0ron.<jff) or superrepel (6* > 150° with 
SroD^off < 10°) a wetting liquid (0y < 90°) on a 
structured surface, two conditions must be met: 


1096 28 NOVEMBER 2014 • VOL 346 ISSUE 6213 


sciencemag.org SCIENCE 


RESEARCH \ REPORTS 


Ap 

yy y y 


H 


0v~ 120' 



B 

Ap 


rr 

rr 




0v « 30° 


Ap 



I "■0Y«o° I 


Fig. 1. Liquid suspension on surface structures of three different topoiogies. (A) Simple structures require By > 90° to suspend water. Ap, is the pressure 
difference between the liquid and air. (B) Reentrant structures allow By < 90° to suspend oils or solvents. They would fail if By ~ 0°, as surface tension acts parallel to 
the horizontal overhangs with little vertical component to suspend the liquid. (C) Doubly reentrant structures allow By ~ 0° to suspend any liquid, as surface tension 
acts on the vertical overhangs with a substantial vertical component. If the liquid-solid contact fraction is small enough, the surfaces would also repel the liquids. 
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(Air) 


Fig. 2. Relation between 
apparent contact angle 
0* and solid fraction U 
for ideal Cassie-state 
droplets with intrinsic 
contact angle By as a 
parameter. As decreases, 
the band of lines narrows, 
indicating that the 
influence of By on 0* 
diminishes. If is below 
6%, 0* is above 150° even 
if 0y - 0°. The green, red, 
and blue lines represent 
the inherent wettability: 
nonwetting (e.g., water on a 
hydrophobic surface), 
moderately wetting (e.g., 
solvent on a hydrophobic 
surface), and highly wetting 
(e.g., fluorinated solvents 
on any surface or most 

liquids on clean Si02), respectively. The three bold lines correspond to the three cases shown in Fig. 1. 



(i) a successful liquid suspension on the rough- 
ness and (ii) a low liquid-solid contact. The mi- 
crostructures should first be able to suspend the 
liquid, supporting a composite interface proposed 
by Cassie and Baxter (23). Once suspended, de- 
creasing the liquid-solid contact would increase 
6* and reduce 6rou-off> hence increasing the re- 
pellenry. In the rare cases of a highly wetting (0 y < 
10°) liquid beading (i.e., successful suspension 
and 6* > 90°) on a structured surface— for ex- 
ample, water on Si02 (13, 24) or hexane on nickel 
(If))— the liquid stuck on the surface rather than 
rolled off; these surfaces are not considered to 
repel the liquid, despite the beading. 

Liquid suspension by surface structures (or 
resisting liquid wetting by surface topologies with 
characteristic length smaller than the liquid’s 
capillary length) was proposed in the 1960s with 
6 y as a critical parameter (8, 10). For 6 y > 90° as in 
water or aqueous solutions on a hydrophobic 
material, a simple microstructure (Fig. lA) would 
suspend the liquid to a Cassie state (1, 2, 8, 10-12) 
(figs. SI and S2). For Oy < 90°, as in oils and or- 
ganic solvents on a hydrophobic material or water 
on a slightly hydrophUlc material, a reentrant mic- 
rostructure (Fig. IB) is required to suspend the 
liquid and resist it from wetting into the cavity 
(3, 8, 10, 13-18, 25). From simple force balance. 


the reentrant topology of Fig. IB would suspend a 
liquid even with By ~ 0° in the absence of any 
positive liquid pressure. However, there is always 
a pressure in reality (e.g., hydrostatic, Laplace, or 
environmental perturbation), and once pushed 
into the cavity the liquid spreads spontaneously. 
So far, the most successful suspension was for liq- 
uids with surface energy as small as y ~ 15 mj/m'^ 
[i.e., pentane (15, 16) and isopentane (16)], leaving 
many fluorinated solvents unresolved. 

In addition to the reentrant microstructure, it 
has long been hypothesized that surface structures 
of a doubly reentrant topology (Fig. 1C) might pro- 
vide a stronger resistance against wetting and 
retain suspension, even if 0y ~ 0° (8-10, 15, 26). The 
mechanism of suspending such a completely wetting 
liquid on a doubly reentrant microstructure is rea- 
soned as follows (see Fig. IQ: Upon contacting the 
surface, the liquid would wet the top surface and 
continue down along the sidewall of the vertical 
overhangs. The liquid would stop advancing at the 
bottom tip of the vertical overhangs, where the sur- 
face tension can start to point upward. Though this 
concept of suspending even highly wetting liquids 
on a doubly reentrant topology has been known 
(8-10, 15, 26) and confirmed with water (24), the 
liquid-solid contact fraction should be sufficiently 
low for the resulting surface to not only suspend 


but also repel the hquid. A highly wetting (6y < 10°) 
liquid suspended on the microstructures would 
stiU spread (i.e., 0* < 90°) on the composite surface 
unless the liquid rests mostly on air. To understand 
how far we are from being able to repel the highly 
wetting liquids, let us assess the contribution of air 
to the repeUency. 

The apparent contact angle 0* for a suspended 
droplet (i.e., in a Cassie state) is described by the 
Cassie-Baxter model (23) as 

COS0* =/sCOS0y-/g (1) 

where ^ is the liquid-solid contact fraction [or 
“solid fraction” for short; i.e., the proportion of 
liquid-solid contact area (including the wetted 
regions inside the roughness) to the projected 
area of the entire composite interface], 7g is the 
gas fraction similarly defined for liquid-vapor 
interface, and^ -i-7g a 1 (27). If the liquid-solid 
and liquid-vapor interfaces are perfectly flat, 
neglecting any solid roughness and meniscus 
curvature— that is, the ideal Cassie state with^ + 

= 1— Eq. 1 simplifies to 

COS0* =,)i(l -H COS0 y) - 1 (2) 

Although valid only for the ideal Cassie state, Eq. 
2 allows us to qualitatively explore the relation 
between 0*,.^, and 0 y. In addition to the widely 
appreciated consequence that 0* can be greatly 
increased as^ decreases, we examine the role of 
the intrinsic contact angle 0y by plotting Eq. 2 
with 0Y as a parameter in Fig. 2. As is evident, the 
difference between the 0* values of a large 0 y and 
a small 0 y decreases as^ decreases. In other words, 
by minimizing^ the contribution of the material’s 
Inherent nonwettabUity (described by the magni- 
tude of 0 y) on the liquid repeUency (described by 
the magnitude of 0*) diminishes. This diminishing 
trend suggests that a structured surface may re- 
pel extremely wetting liquids if.^ is very small. 
For example, even a completely wetting liquid 
(0Y = 0°) maybe superrepelled (6* > 150°) if^ < 
6%. However, it should not be forgotten that 
this argument is valid only for the Cassie state 
(i.e., the suspended state), which is exceedingly 
difficult to achieve if^ becomes very small. Even 
for the reentrant topology of Fig. IB, the suspen- 
sion force becomes too small before becomes 
smaU enough to repel Uquids with very small By. 
This difficulty explains why superrepellency 
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Fig. 3. Design and 
fabricated resuits of the 
SiOa surface. (A) Designed 
surface of microposts with 
doubiy reentrant nano- 
overhangs. As key geometric 
parameters, D is the post top 
diameter, P is the center-to- 
center distance (i.e., pitch) 
between adjacent posts, and 

5 and t are the iength and 
thickness, respectiveiy, of 
the verticai overhang. To 
make smail enough (4 < 

6%), S and t should be 
shrunk to extreme values. 

(B to E) Scanning electron 
micrographs of the fabri- 
cated surface. (B) Top- 
angled view of the square 
array of circular posts with 
D ~ 20 gm, P = 100 (im, 

6 - 1.5 nm, and t ~ 300 nm, 
resulting in ~ 5%. (C) 

Bottom-angled view of one 
post. (D) Cross-sectional 
view of one post. (E) 

Magnified cross-sectional 
view of the vertical overhang. Note the similarity with the ideal topology of Fig. 1C. 



has been shown for liquids with surface tensions 
above ~20 mN/m (13) but not for those with 
surface tension values of 15 to 20 mN/m, such 
as pentane (15, 16) and isopentane (16), which 
have been suspended but not repelled. 

From Figs. 1 and 2, one can now reason that 
a structured surface may repel any liquid if the 
microstructures are doubly reentrant and also 
of a low enough solid fraction. However, com- 
mon doubly reentrant shapes in the literature 
(8, 10, 15, 18, 26) produce only a weak suspension 
and a moderate solid fraction insufficient to 
repel highly wetting liquids. To suspend com- 
pletely wetting liquids on a surface with a min- 
imal solid fraction, an entire surface should be 
uniformly covered with doubly reentrant struc- 
tures having vertical overhangs as thin, vertical, 
and short as possible. As illustrated in Fig. 1C, 
such an ideal doubly reentrant structure mini- 
mizes the break-in force by the liquid pressure 
that wets the cavity and maximizes the surface 
tension force that suspends the liquid against 
wetting (eq. SI) (9). The thin and vertical geom- 
etry of the overhangs minimizes their projected 
area added to the solid fraction, and the short- 
ness of the overhang keeps the increase of the 
solid fraction by the vertical surfaces at bay. Some 
superhydrophobic or superoleophobic surfaces 
described in the literature attempted to incorpo- 
rate doubly reentrant structures but with little 
success. For example, only a few doubly reen- 
trant structures were formed among predom- 
inantly simple or reentrant structures prone to 
wetting (3, 14, 25), and structures barely sat- 
isfying the doubly reentrant shape were repli- 
cated from springtail skin with only a moderate 
solid fraction (18). 


To fulfill aU the requirements reasoned above 
and quantified from Fig. 2, we designed a surface 
illustrated in Fig. 3A: an array of doubly reentrant 
structures consisting of microscale posts with 
nanoscale vertical overhangs. Posts were chosen 
over ridges or holes to minimize^ more easily. 
Also, the post array allows the air underneath the 
droplet to remain connected to the atmosphere 
so that the liquid is suspended only by surface 
tension and is not assisted by the pressure of the 
trapped air. We chose to form the surface struc- 
tures from Si02 for the following two reasons: First, 
clean Si02 is highly wetted (i.e., By < 10°) by most 
liquids (except liquid metals such as mercury), 
including water (1, 20). Because roughening of a 
Si02 surface is supposed to amplify the liquid af- 
finity to complete wetting (1), structuring a Si02 
surface to repel liquids should provide a stark con- 
trast to the existing approach. Second, Si micro- 
machining provides sophisticated equipment and 
techniques to process Si02. With precisely con- 
trolled thermal oxidation of a shallow-etched sil- 
icon surface followed by three sequential etching 
steps on Si02 and Si (fig. S3E) (28), we successfully 
fabricated a Si02 surface (1.7 cm by 1.7 cm) with 
close-to-ideal doubly reentrant structures (Fig. 3, B 
to E). The inclined angle of the vertical overhang is 
measured to be ~85 ± 1° (Fig. 3E), providing a max- 
imum suspension force that is 99.6% of the per- 
fectly vertical overhang shown in Fig. 1C. In spite 
of the overall resemblance between the micro- 
posts in Fig. 3B and those of superoleophobic 
surfaces (13, 15-17), it is the close-to-ideal nano- 
scale vertical overhangs in Fig. 3, C to E, that 
lead to an unprecedented liquid-repellency. 

To evaluate the liquid repellenty, we chose 14 
different liquids (table SI) (29), including water. 


ionic liquid, acid, oils, and numerous polar or non- 
polar organic or fluorinated solvents with surface 
tensions ranging from 72.8 mN/m (i.e., water) to 
the lowest known value of 10 mN/m (i.e., FC-72). As 
expected, a smooth Si02 surface was highly wetted 
(0* = By < 10°) by all of the liquids (table S2). In 
contrast, our structured SIO2 surface successfully 
suspended and repelled all of the tested liquids, 
beading them into Cassie-state droplets (water, 
methanol, and FC-72 shown in fig. S4A and 
movie SI) and letting them roll around (fig. S4B 
and movies S2 and S3); that is, behaving super- 
omniphobic in air. 

To quantify the repeUency of our surface, we 
measured the advancing and receding contact 
angles (Fig. 4A) and roll-off angles (fig. S4C) of all 
14 liquids. For comparison. Fig. 4A also includes 
the other two liquid-repeUent surfaces analyzed 
in Fig. 1: a superhydrophobic surface consisting 
of cylindrical posts (Fig. lA and fig. S3B) and a 
superoleophobic surface consisting of posts with 
reentrant overhangs (Fig. IB and fig. S3D), both 
of which were coated with a hydrophobic layer of 
C4Fg. As expected, although the superhydropho- 
bic surface with vertical posts could not suspend 
liquids with surface tension below ~40 mN/m, 
the superoleophobic surface with reentrant posts 
repelled liquids with lower surface tension (20 to 
40 mN/m). However, liquids with even lower 
surface tension (<20 mN/m) could not be sus- 
pended, as they wicked between the reentrant 
posts. In contrast, on the surface with doubly 
reentrant posts, all 14 liquids formed large con- 
tact angles, even without any hydrophobic coating. 

The extent to which wettability is modulated 
by surface roughness is shown in Fig. 4B, where 
the apparent wettability (cosB*) is plotted as a 


10 9 8 28 NOVEMBER 2014 ■ VOL 346 ISSUE 6213 


sciencemag.org SCIENCE 




RESEARCH \ REPORTS 


160 


140- 


® 120 


100 


■S 80 


60 


40 


20 





-f- -4-^. - . - 



■ ■■ 

ll 

ll 

l( 






W 

♦ 

\ 

♦ 


■ • 0 A on doubly reentrant SiOa 

1 


^ o 9 R on doubly reentrant Si 02 

i V. 


A 9 A on reentrant C 4 F 8 

1 


A 9 R on reentrant C 4 Fs 

1 ; 

1 


■ 9 A on vertical C 4 Fa 

- -om- ' 

.-•o’ 

□ 9 R on vertical C 4 F 8 


10 


20 30 40 50 60 

Surface tension of liquids y (mN/m) 


70 



1 mm 


Fig. 4. Omniphobicity of the structured Si02 
surface. (A) Apparent advancing (0*a) and reced- 
ing (e*R) contact angles of the 14 liquids measured 
on three liquid-repellent surfaces: our omniphobic 
surface and two control surfaces of the same 
nominal solid fraction (f^ - 5%). Data on the 
omniphobic surface are depicted by blue circles 
(solid and hollow): data on the control surfaces 
with reentrant and vertical topologies are repre- 
sented by orange triangles and green squares, 
respectively. Each data point is an average of more than 100 measurements. Error bars are omitted here for clarity and are instead shown in fig. S6. (B) Relations 
of contact angles on a smooth surface (cosBy) and on a structured surface (cose*). The theoretical relations from the Wenzel and Cassie-Baxter models are 
plotted as solid black lines. Data near (1,-1) and (1,1) are shown in the zoomed-in boxes, revealing the difference between our structured Si02 surface and the 
control surfaces, especially when liquids highly wet the material. (C) Robust repellency of the structured Si02 surface demonstrated by bouncing FC-72 off the 
super omniphobic Si02 surface with doubly reentrant posts of D ~ 10 i^m, P = 50 |xm, 5 ~ 920 nm, f ~ 270 nm, and fj ~ 5% under Weber number -0.42. 



147 ms 


181 ms 


191 ms 


225 ms 


263 ms 


764 ms 


function of the inherent wettability (cos0y). Data 
from our surface with doubly reentrant posts (i.e., 
blue eircles) were populated at the lower right 
eorner in the fourth quadrant near point (1,-1), 
exhibiting the exceptional ability to render a 
highly wettable material superrepeUent. In eon- 
trast, although the two control surfaces permit a 
Cassie state with nonwettable or partially wetta- 
ble material, they got soaked when the material 
was highly wetted by the Mquids of very low surface 
tension (i.e., hexane and six fluoiinated solvents), 
displaying 6* ~ 0° with data populated near 
point (1,1). These results are eonsistent with the 
theory schematically summarized in fig. S2. 

In addition to repelling all 14 liquids (movie 
S2), our superomniphobie surfaces are also ex- 
pected to sustain static (fig. S5 and movie S4) and 
dynamic (movie S5) pressures better than the 
existing superhydrophobic and superoleophobic 
surfaces {26). The doubly reentrant structures 
allow droplets to bounce on even extremely sparse 
posts (i.e., tens of micrometers of pitch and a solid 
fraction of only ~5%). With high-speed imaging, 
water, methanol, and FC-72 droplets were con- 
firmed to bounce off the truly superomniphobie 
Si02 surfaces (movie S5). Water (y = 72.8 mN/m) 
and methanol (y = 22.5 mN/m) droplets rebounded 
on a surface with microposts of 100-|rm pitch; 
this pitch was much larger than those reported in 
the literature (3, 15). However, FC-72 (y = 10 mN/m) 
droplets penetrated and wetted the above sur- 
face at impact. A surface with uniformly halved 


structures (i.e.,7s remaining at ~5%) was further 
prepared to provide enough resistance against 
impalement and let FC-72 droplets rebound, as 
shown with snapshots in Fig. 4C. While resisting 
the physical intrusion of liquids, this surface has 
no defense against some other intrusion mech- 
anisms such as condensation inside the cavity. 
The internal condensation would be a common 
issue to aU existing superhydrophobic and su- 
peroleophobic surfaces (7), calling for a provi- 
sion (5) for practical use. 

Because the proposed superrepellency depends 
only on physical attributes, we further fabricated 
metal (i.e., tungsten) and polymer (i.e., parylene) 
counterparts based on the given Si 02 surface and 
confirmed that they possess the same super- 
repellency as expected (fig. S4A). The ability to 
repel fluorinated solvents may allow the elec- 
tronic circuits to be cooled by nucleate boiling 
(i.e., the most efficient mode of cooling (4). Free 
of polsmieric coating, the superomniphobie Si 02 
surface can serve at high temperatures. The sur- 
face was found unaltered after storage at >1000°C 
and was used to demonstrate rolling-off of an- 
other FC liquid at 150°C and a nonvolatile liquid 
at >320°C (fig. S8 and movie S6). The polymer- 
free parts are expected to last longer in outdoor 
environment, where polymeric materials tend to 
degrade faster. Unaffected by the surface chemis- 
try, the superomniphobie Si 02 surface also dem- 
onstrated prolonged repellency to biological fluids 
(such as the sheep serum we tested), whereas a 


regular superhydrophobic surface lost the repel- 
lency (fig. S9 and movie S7). 
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MINERALOGY 

Discovery of bridgmanite, the most 
abundant mineral in Earth, in a 
shocked meteorite 

Oliver Tschauner,** Chi Ma,^ John R. Beckett,^ Clemens Prescher,® 

Vital! B. Frakapenka,^ George R. Rossman^ 

Meteorites exposed to high pressures and temperatures during impact-induced shock 
often contain minerals whose occurrence and stability normally confine them to the 
deeper portions of Earth's mantle. One exception has been MgSiOg in the perovskite 
structure, which is the most abundant solid phase in Earth. Here we report the discovery 
of this important phase as a mineral in the Tenham L6 chondrite and approved by the 
International Mineralogical Association (specimen IMA 2014-017). MgSiOg-perovskite is 
now called bridgmanite. The associated phase assemblage constrains peak shock 
conditions to ~ 24 gigapascals and 2300 kelvin. The discovery concludes a half century of 
efforts to find, identify, and characterize a natural specimen of this important mineral. 


I n the geosciences, the complexity of composi- 
tions and histories of naturally occurring min- 
erals and rocks provides an important ground 
truth against which experiment and theory 
are measured. One of the most giaring omis- 
sions in the study of Earth’s mantle has been the 


inability to find naturally occurring specimens of 
what we believe to be Earth’s most abundant rock- 
forming phase, (Mg,Fe)Si 03 in an orthorhombic 
ABO3 perovskite structure. Despite appearing for 
decades in numerous experimental and theoretical 
studies (i-5), characterizations of possible natural 


samples have not been sufficient to meet Interna- 
tional Mineralogical Association criteria for nam- 
ing a new mineral (ff). Consequently, any detailed 
chemical, structural, and petrographic analysis of 
natural (Mg,Fe)Si 03 -perovskite has remained 
impossible. In addition, having a formal min- 
eral name for a phase that is so important is im- 
portant in itself Various ambiguous or incorrect 
terms such as “silicate perovskite” and “perov- 
skite” have been used for describing this phase, 
but they convey ambiguity to the description of 
research findings. We put this ambiguity to rest 
by describing the natural occurrence of bridg- 
manite: MgSiOg in the orthorhombic ABO 3 pe- 
rovskite structure. The name bridgmanite honors 
Percy W. Bridgman (1882-1961), the 1946 Nobel 
laureate in Physics, for his fundamental contribu- 
tions to high-pressure mineralogy in particular, 
and to high-pressure research in general. 

The importance of bridgmanite in the lower 
mantle of Earth has long been recognized: Several 
lines of evidence show that it forms through a 
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Fig. L Scanning electron microscope image of a bridgmanite- 
akimotoite aggregate. The backscatter electron image re- 
veals an a^egate of submicrometer-sized crystals of bridgmanite 
and akimotoite enclosed in (Mg,Fe)Si03 glass and within a 
Tenham shock-melt vein. Majorite is found in the vein matrix. 
The bridgmanite-akimotoite clast is a pseudomorph after 
pyroxene that was trapped in the melt. This observation is 
consistent with an earlier report about the possible occur- 
rence of bridgmanite with akimotoite in Tenham (16). 
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breakdown of rock-forming (Mg,Fe) 2 Si 04 into 
(Mg,Fe)0 periclase and (Mg,Fe)Si 03 bridgmanite 
in the lower mantle of Earth below a depth of 
660 km (1-5). Bridgmanite remains stable to the 
D" layer, nearly down to the core-mantle bound- 
ary region at a depth of 2900 km (7). Thus, 
bridgmanite makes up about 38 volume % of 
the entire Earth (4). The chemical and physical 
properties of bridgmanite have a large influence 
on elemental distribution, mass flow, and heat 
flow within Earth’s mantle. Numerous efforts have 
aU foiled to find a naturally occurring example of 
this elusive mineral for several reasons. Most im- 


portantly, (Mg,Fe)Si 03 in the perovskite structure 
is stable only at very high pressures and temper- 
atures (8). The mineral is metastable under am- 
bient conditions but vitrifies above temperatures 
as low as 310 K (9). The exhumation of rocks 
originating in the lower mantle is far too slow 
to permit the preservation of terrestrial bridg- 
manite, although inclusions in some diamonds 
from such rocks have been interpreted as the 
breakdown products of bridgmanite (10, 11). Heavily 
shocked meteorites provide an alternative route 
for preserving bridgmanite. Pressures and tem- 
peratures during the shock event can be high 


enough to stabilize bridgmanite, and the release 
to ambient conditions maybe sufficiently fast to 
kinetically inhibit breakdown reactions. As a re- 
sult, several high-pressure phases known to be 
stable only deep within Earth’s mantle have been 
found as minerals in these meteorites (12-18). 
These observations instigated decades of efforts 
to find and characterize bridgmanite in shocked 
meteorites (15-19). Meticulous transmission elec- 
tron microscopy examination sdelded indications 
of the presence of bridgmanite in chondritic and 
martian meteorites (15-19). However, rapid vitri- 
fication in the electron beam, a lack of adequate 
sets of reflections for unique crystallographic in- 
dexing, and the absence of quantitative structure 
factor moduli rendered these observations insuf- 
ficient to characterize a new mineral. 

A different approach to the search for bridg- 
manite lies in using microfocused high-energy 
ssmchrotron x-ray beams instead of electron 
beams for diffraction. The intense high-energy 
x-ray beam does little to damage bridgmanite 
because of its low absorbance. Micro-focusing 
and novel fast readout area detector techniques 
permit efficient mapping of possible host regions 
in shocked meteorites (20). Our search focused 
on shock-melt veins and their inclusions, which 
were previously identified as the hosts of other 
high-pressure silicate phases (12-20). In particu- 
lar, we examined the highly shocked Tenham L6 
chondrite and identified bridgmanite in clasts 
within the shock-melt veins. We found bridg- 
manite always associated with akimotoite but 
never as isolated crystals in the melt vein. These 
two phases along with a vitreous matrix whose 
composition is identical within error to that of the 
bridgmanite (table SI) replace precursor orthopy- 
roxene crystals trapped within a melt vein (Fig. 
1 and fig. SI). We interpret this assemblage to 
reflect bridgmanite that partially vitrified upon re- 
lease from the shock state on the parent body or 
during its residence on Earth after its fall in 1879. 
We hypothesize that the volume expansion upon 
transformation from bridgmanite or dense glass 
into normal glass by ~ 33% (21) and 1 to 2% 
(22), respectively, induces stresses in the sur- 
rounding rock that help preserve the remain- 
ing bridgmanite. 

Bridgmanite assumes the Pnma perovskite 
structure with unit cell parameters a = 5.02 ± 
0.03 A, 6 = 6.90 ± 0.03 A, and c = 4.81 ± 0.02 A, 
which yield a unit cell volume of 167 ± 2 A^ 
(Fig. 2) (23). The uncertainties (± SEM) are from 
(i) uncertainty in the Rietveld refinement (see 
Fig. 2) and (ii) variations in cell parameters due 
to varsdng chemical composition (fig. S2). Akimo- 
toite and ringwoodite also contribute to the dif- 
fraction pattern. The average composition of the 
type material (table SI) has a formula unit of 
(Mgo. 75 Feo. 2 oN*to.o 3 C^.o 2 Mtto.oi)Sii.oo 03 (^5). The 
composition is well within the range of synthetic 
bridgmanites, despite being a quite sodic and Fe- 
tich composition in comparison (fig. S2). 

The cell volume of natural bridgmanite lies on 
an extension of the trend of volume expansion 
along with increasing Fe^’’’ content (24), which is 
consistent with a large amount of ferric iron as 


Fig. 2. Powder diffraction 
pattern and Rietveid 
refinement of bridgma- 
nite. The figure shows the 
observed diffraction pat- ^ 

tern (biack iine and q 

symbois) of bridgmanite- -2 

bearing shock-meit vein ^ 

materiai in thin section c 

USNM 7703 (23), whoie ■§ 

pattern refinement (red), 3 

refined pattern of bridg- 2 

mannite (blue), residuai of 
fit (green), and positions 
of observed refiections of 
bridgmanite, akimotoite, 
and ringwoodite (blue, red, 
and green tick marks, 
respectively). The x-ray 
wavelength was 0.3344 A. 

The weighted-profile 
refinement factor was 

0.08, and there were 799 observations. The examined portion of the Tenham meteorite revealed 
diffraction by bridgmanite, akimotoite, and ringwoodite in relative volume proportions of 11, 16, and 74%. 
Ringwoodite forms clasts within the shock-melt vein (fig. SI), some of which occur beneath the examined 
akimotoite-bridgmanite-glass clast (the x-ray beam covers a 3 x 4 gm^ area but it passes through the 
entire ~30-(im-thick rock section). The bridgmanite-akimotoite aggregate has a volume proportion -- 
0.7:1, in accord with scanning electron microscope examination, rel., relative; a.u., arbitrary units. 
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Fig. 3. Correlation of i 

unit cell volume with Fe 
content in synthetic i 

and natural bridgma- 
nite. Bridgmanites with i 

dominantly ferrous iron 
exhibit a weak increase in ...^ i 

volume with increasing S. 

iron content [dashed g i 

curve (24)]. Bridgmanites -§ 

with large concentrations ^ i 
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variations (fig. S2) as 
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crystal diffraction studies of the Mg end 
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compared to synthetic bridgmanites (Fig. 3). As 
noted above, the holotype specimen of bridg- 
manite also contains high concentrations of Na. 
This may extend the stability field of bridgmanite 
(23) and supports charge balance for ferric iron 
via Na-Fe^'^-coupled substitution in holotype 
bridgmanites at redox conditions below the 
iron-wiistite buffer (26), but plausibly also in 
the terrestrial and martian (27) lower mantles. 

The evaluation of the shock conditions in 
Tenham beyond the examination of plausible 
recovery paths for bridgmanite is outside the 
scope of this study. The strict association of 
akimotoite and bridgmanite and the likely ab- 
sence of bridgmanite in the matrix of the shock- 
melt vein are pivotal to an assessment (fig. S3). 
They suggest that the peak pressure exceeded 
23 GPa, with temperatures in the melt exceeding 
the solidus at -2200 K The absence of bridg- 
manite as isolated crystals within the shock-melt 
vein suggests that pressures were too low to 
permit crystallization from melt. Using these 
constraints, we estimate the conditions of for- 
mation of bridgmanite in Tenham to be 23 to 
25 GPa and 2200 to 2400 K (fig. S3). This esti- 
mate is consistent with a more recent estimate 
by Xie et al. (20) based on observation of vitrified 
bridgmanite. The occurrence of bridgmanite 
along with conditions of formation of other high- 
pressure minerals imposes strong constraints on 
pressure and temperature conditions during high- 
level shock events in meteorites. 
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CHEMICAL OCEANOGRAPHY 

Increasing anthropogenic nitrogen in 
the North Pacific Ocean 


n-Nam Kim,^ Kitack Lee,^* Nicolas Gruber,^ David M. Karl,® John L. BuUister,^ 

Simon Yang,® Tae-Wook Kim® 

The recent increase in anthropogenic emissions of reactive nitrogen from northeastern 
Asia and the subsequent enhanced deposition over the extensive regions of the North 
Pacific Ocean (NPO) have led to a detectable increase in the nitrate (N) concentration of 
the upper ocean. The rate of increase of excess N relative to phosphate (P) was found 
to be highest (~0.24 micromoles per kilogram per year) in the vicinity of the Asian 
source continent, with rates decreasing eastward across the NPO, consistent with the 
magnitude and distribution of atmospheric nitrogen deposition. This anthropogenically 
driven increase in the N content of the upper NPO may enhance primary production 
in this N-limited region, potentially leading to a long-term change of the NPO from being 
N-limited to P-limited. 


T he rate of deposition of reactive nitrogen 
(i.e., NOj, + NH,r and dissolved organic forms; 
see supplementary text SI) ifom the atmo- 
sphere to the open ocean has more than 
doubled globally over the past 100 years (I), 
reaching a magnitude that is comparable to about 
half of the global ocean N 2 fixation (2). The in- 
crease in atmospheric nitrogen deposition (AND) 
is particularly acute in the North Pacific Ocean 
(NPO) due to rapid population growth and bur- 
geoning industrial activity in northeast Asian 
countries. These changes in northeast Asia have 
markedly increased reactive nitrogen fluxes in 
the adjacent marine environment (3, 4), largely 
through atmospheric transport by westerly winds 
and subsequent deposition. Though it has been 
recognized that such an increasing addition of 
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reactive nitrogen to the oeean eould lead to ma- 
jor changes in the upper ocean nitrogen cycle 
and biological productivity (5), the majority of 
studies conducted to date have suggested that 
the impact is small and not detectable (1), except 
in near-shore environments and marginal seas 
(3). A recent study directly comparing nutrient 
measurements made over more than two dec- 
ades showed that nitrate (N) concentration has 
increased in the northeast Asian marginal seas 
and that this increase was probably due to strongly 
growing AND (3). Here we extend this analysis to 
the entire NPO and show that the anthropogenic 
influence has already affected the open-ocean 
nitrogen cycle. 

Owing to a lack of basin-wide, nutrient con- 
centration data of sufficient duration (decades) 
at strategic locations, we reconstructed the tem- 
poral changes in N across the NPO using a method 
based on the relation between the excess of N 
in a water parcel relative to that expected based 
on the phosphate (P) concentration and the 
chlorofluorocarbon-12 (CFC-12)-derived venti- 
lation age of that water parcel (6) (supplemen- 
tary text S2). The excess in N relative to P— i.e., 
N* (7, 8)— at each sampling location was cal- 
culated as: N* =N- i?N;p x P, where N and P are 
the measured concentrations and Hn,? is the 
Redfield ratio of 16:1. Because of increasing 
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biases and uncertainties in the CFC-12-derived 
ages for water masses >40 years old (S), we lim- 
ited our analysis to water parcels <40 years old 
(i.e., waters ventilated between the mid-1960s and 
late-1990s). Although this single-ventilation-age 
approach has shortcomings compared with meth- 
ods that estimate the full ventilation-age dis- 
tribution of a water parcel (P), the data used in 
this study were collected during a period of quasi- 
linear growth of CFC-12 in the atmosphere (fig. SI 
and table SI), and the differences between the two 
types of tracer-derived ages have little Impact on 
trends examined in the present study (6). 

Three distinct trends in A/* as a function of 
depth (or ventilation date) were found in the 
analysis. Trend 1 shows that if the magnitude of 
the N sources is similar to that of the N sinks (N 
sources « N sinks), the seawater N* values re- 
main approximately unchanged regardless of the 
age of the water parcels (see the black line in 
Fig. 1). Trend 1 is typically observed in deep waters 
(>3000 m) but is not confined to such waters. 
Trend 1 can also be seen in the upper waters of 
the Pacific Ocean sector of the Southern Ocean 
(solid and dotted lines in the upper inset in 
Fig. 1) and the Indian Ocean (dotted line in the 
lower inset in Fig. 1), which are regions far 
removed from major AND sources. Trend 2 
shows that if the magnitude of the N sources is 
less than that of the N sinks (N sources < N 
sinks), the values of N* fall below the black line 
representing trend 1 (see the purple curve and 
the lower inset in Fig. 1). Trend 2 is applicable to 
intermediate depths at locations where low O2 
concentrations promote denitrification processes 


(NOa" ^ N2O/N2), which remove N. If the mag- 
nitude of the N sources in recent decades ex- 
ceeded that of the N sinks (N sources > N sinks), 
as in trend 3, the values of N* should increase as 
the age of a water parcel decreases (see the light 
blue line in Fig. 1). Trend 3 is usually applicable 
to the upper ocean, where AND and N2 fixation 
overwhelmingly contribute to elevated N* values 
{10). Thus, to infer the imprint of increasing 
AND in the NPO, we focus on data exhibiting 
trend 3. 

As mixing of water masses with different ini- 
tial values of N* and ventilation age could bias 
trend 3, we used the extended optimum multi- 
parameter method (supplementary text S3) to 
separate the temporal N* trend from any mixing 
contribution. This analysis was performed along 
density coordinates 09 for the core density layers 
of the main water masses in all six regions [i.e., 
region 1: 09 = 27.2 to 27.3 for the East Sea 
Intermediate Water (ESIW), regions 2 to 6: 09 = 
25.5 to 26.5 for North Pacific Central Water 
(NPCW)] (insets in Fig. 2, C to H; see also sup- 
plementary text S3 and fig. S2). The rate of N* 
increase from this isopycnal analysis is in good 
agreement with the depth-based rates estimated 
from linear regression of the relation between N* 
and the partial pressure of CFC-12 (pCFC-12)- 
derived ventilation date for data found on trend 
3 in all six regions (Fig. 2, B to H, and Table 1). 
This good consistency indicates that trend 3 is 
unlikely to be generated by mixing between low- 
N* water from I0W-O2 intermediate depths and 
high-Al* upper water. It remains possible that 
this temporal trend is an artifact arising from the 


production and remineralization of organic mat- 
ter occurring at a ratio that is substantially dif- 
ferent from our assumed Redfield ratio of 16:1, 
especially in light of recent findings suggesting 
that organic matter formed in low latitudes tends 
to have an N:P ratio substantially above the Redfield 
ratio {IT). However, the positive slope of the rela- 
tion we observed [i.e., decreasing N* with depth 
(Fig. 2, B to H)] indicated that this biological 
process is unimportant, as it would generate the 
opposite slope in the thermocline. Specifically, 
the formation of organic matter in the upper 
ocean with a high N:P ratio would lower N* 
values in surface water {11) but increase N* in 
thermocline waters where the N-rich organic mat- 
ter is remineralized. 

Our mhdng-corrected reconstructions of the 
N* trends between the late 1970s and mid-1990s 
show increases at all locations across the NPO 
(Fig. 2). The rate of N* increase was highest 
(~0.24 pmol kg“^ yr“') in the East Sea (region 1 in 
Fig. 2A) and steadily decreased in an eastward 
direction with increasing distance irom the source 
continent (Fig. 3). In the East Sea, the EP values for 
waters older than 30 years (<1970s) remained 
constant {N* = -6.6 ± 0.5 pmol kg“'), whereas the 
values for waters younger than 30 years (>1970s) 
increased with time (Fig. 2C). In the mid-latitude 
NPO (20°N to 40°Nj, N* values greater than 
deep water values began to appear in the upper 
waters (<600 m) ventiiated in the late 1970s 
(Fig. 2, D to H). 

To further confirm that the inferred trends of 
N* in the NPO reflect temporal changes in AND 
and are not mixing artifacts, we analyzed the 
temporal evolution of N* in the western and 
central NPO, where the temporal coverage of the 
data is sufficient to perform such an analysis. 
Using the PACIFICA database, we found that in 
the western and central NPO (upper left inset in 
Fig. 4A; areas of 20°N to 40°N and 130°E to 180°W), 
the subsurface N* (300 to 400 m depth) sub- 
stantially increased in the past decade (Fig. 4A); 
the estimated rates within this depth range were 
comparable to those obtained based on pCFC-12 
age (upper right inset in Fig. 4A). This increase in 
N* occurred without a concurrent change in dis- 
solved O2 (fig. S3A) or P (fig. S3B). Similar trends 
were also observed in the central NPO (i.e., the 
area between 20°N to 40°N along the 150°W 
line) (fig. S4). Furthermore, the overall rate ofN* 
increase (0.04 ± 0.02 pmol kg“' yr“' for the period 
1988-2011 for the 100 to 200 m depth interval) 
observed at Station ALOHA (Hawaii Ocean Time- 
series) (Fig. 4B) was also comparable to the rate 
based on pCFC-12 age, although the time frame 
for the comparison is somewhat different. 

To attribute the observed temporal trends in 
N* to the recent increases in AND, we need to 
exclude alternative explanations, in particular 
the possibihty of a recent increase in N2 fixation. 
Although we cannot completely exclude this pos- 
sibility, several lines of evidence indicate that 
N2 fixation may not be a major contributor. 
First, an increase in N2 fixation would also lead to 
a decrease in the upper ocean P concentration 
and a concomitant increase in its thermocline 
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Fig. 1. Three trends 
of N* variation. This 
figure shows the 
reiation between N* and 
the pCFC-12-derived 
ventiiation date (which 
is the year that the 
CFC-12 concentration 
in the water sampie 
wouid have last been 
in equilibrium with the 
atmosphere) for 
seawater samples 
obtained in the East 
Sea (Sea of Japan) as 
part of the Circulation 
Research of the East 
Asian Marginal Seas II 
program in 1999. 

Trends in the N* 
variations associated 
with three types of 

cases are presented: Trend line 1 (black solid and dotted lines) indicates a balance of N sources and 
N sinks, relative to R Trend line 2 (purple line) would occur if N sinks > N sources (e.g., due to 
denitrification processes in low-oxygen waters). Trend line 3 (light blue line) indicates major sources 
of N in recently ventilated waters (and also indicates that N increased over time, regardless of 
changes in P). The color scale indicates depth. The inset shows the relation between N* and the 
pCFC-12-derived ventilation date in the Pacific Ocean sector of the Southern Ocean (50°S to 60°S) 
and in the tropical Indian Ocean (5°S to 20°S)— two regions far removed from major AND sources— 
using the PACIFICA and GLODAP databases, respectively. 
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concentration, which we do not observe (figs. 
S3 and S4). Second, an increase in N 2 fixation 
would be expected to shift the peak in the rate of 
increase of N* from near the surface to greater 
depths, as N-rich organic matter of the N 2 fixers 
is remineralized in the thermocline. In contrast, 
we observed the highest rate of N* increase in 
the surface ocean (Fig. 2). Taken together, these 
findings suggest that N 2 fixation does not add 
substantially to the long-term increase in N* in 
the NPO. However, some contribution of N 2 fix- 
ation cannot be entirely excluded, especially given 
that the AP-based rates of N increase exceed the 
modeled and observed rates of AND (Fig. 3). At 
region 5 near Hawaii, for example, the Af*-based 
rates are two to three times higher than the 
observed rates of AND (from dry and wet dep- 
osition). In this region, marine N 2 fixation is 
estimated to contribute to -50% of the export 
production (12, 13). 


Having excluded N 2 fixation as a major con- 
tributor for most of the NPO, AND remains the 
only major process that can explain the recent 
increases in N concentration in the NPO. Several 
lines of evidence strongly support the conclusion. 
First, the start of the increase in N* in the mid- 
latitude NPO around the 1970s coincided with 
the onset of a rapid increase in anthropogenic 
nitrogen emissions from northeast Asian (14). 
Second, the spatial pattern and absolute magni- 
tude of the rate of increase of N* agreed well with 
the AND rates derived from an atmospheric 
transport model (IS) and observations (16, 17) 
(Fig. 3). Furthermore, the meridional distribu- 
tion of aerosol N concentrations was also con- 
sistent with the distribution of subsurface N* in 
the central NPO (i.e., 200 to 300 m along the 
160°E and 180°W lines; see fig. S5). In particular, 
atmospheric N concentrations measured at Mid- 
way (28.2°N and 177.4°W) in the spring (March to 


Table 1. Comparison of rates of N* increase 
based on isopycnal layer and depth for six re- 
gions. See Fig. 2A for region iocations. For iso- 
pycnai anaiysis, oe = 27.2 to 27.3, representing the 
core density iayer of ESIW, was chosen for region 1; 
and Oe = 25.5 to 26.5, representing the core density 
iayer of NPCW, was chosen for regions 2 to 6 (fig. 
S2). Depth rates were derived from Fig. 2. 


Methods 


Regions 

Isopycnal 

Depth 

((imol kg' 

-lyr-i) 

1 

0.24 ± 0.03 

0.25 ± 0.01 

2 

0.11 + 0.03 

0.12 + 0.01 

3 

0.11 + 0.04 

0.10 ± 0.03 

4 

0.04 ± 0.02 

0.13 ± 0.02 

5 

0.03 ± 0.04 

0.08 ± 0.01 

6 

- 0.03 ± 0.30 

0.03 + 0.01 



Regions 2-6 


o • 4“ 


25.5 - 26.5 


1960 1970 1980 1990 2000 


1960 1970 1980 1990 2000 


20 10 
ApCFC12 age (year) 


200 300 400 

depth (m) 


oi 

± -4 

0 

1 -6 


slope = 0.08 ± 0.01 prrtol kg ^ yr 


•r.». 


• —jr 


■•V- 


122-//- 


Fig. 2. Estimation of the rate of increase in 
excess N (N*). (A) Map showing the NPO. The 
coior scaie indicates AND in 1993 (15) overiaid 
with the mean wind vectors over the period 
1990-1999 (25). The numbers 1 to 6 in red 
indicate the East Sea and five sections in the 

NPO (PIO, P13, P14, P16, and P17), respectively (table SI), and locations of data 
used in this analysis are shown as black circles. (B) Plot of AN* (AN* = N*obs - 
IKi-N*!) versus ApCFC-12 age (ApCFC-12 = pCFC-12 ageoBs ' 2>,pCFC-12 age,), 
corrected for the effect of physical mixing (x,) among different water masses (i.e., 
A = observed value - estimated value from end-member mixing) (7), within the 
range of potential density ae = 25.5 to 26.5 in regions 2 to 6 from the mid- 
latitude NPO. The color scale indicates water depth. (C to H) Plots of seawater 
N* values (micromoles per kilogram) versus ventilation date (derived from 
pCEC-12 ages) for regions 1 to 6. Numerical values for ventilation dates are 
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shown only for 1960-2000 due to large uncertainties in earlier dates (see text). 
The color scale indicates dissolved O 2 concentration. Green shading indicates 
the envelopes of the deep water N* values. The star symbols and error bars 
indicate the mean of the deep water N* values and the standard deviation (SD) 
from the mean, respectively. The rates of N* increase (/V*obs > N*deep) were 
estimated from the linear regression analysis (shown as the red lines).The insets 
show plots of seawater AN* versus ApCFC-12 age for the core density layers of 
the ESIW in region 1 (oe = 27.2 to 27.3) and the NPCW in regions 2 to 6 (ae = 25.5 
to 26.5). See supplementary text S2 and S3 for the data and isopycnal analysis. 
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May) showed an increasing trend over the period 
1981-2000 (IS). Third, the magnitude of the AND- 
induced N* increase is consistent with expec- 


tations based on a simulation conducted with 
an updated version of the National Center for 
Atmospheric Research (NCAR) Community Earth 


System Model {19) over the Anthropocene 
(supplementary text S4 and fig. S6), where AND 
increased from the pre-industrial era (1850) to 
modern times (2000). This model includes all 
of the key processes influencing the marine 
nitrogen cycle (water-column and benthic de- 
nitrification, N 2 fixation, AND, and river in- 
puts) and thus is well suited to analyze the 
consequences of increasing AND on the ocean 
nitrogen cycle. Finally, such clear signals of 
seawater N* increase are seen only In the NPO 
and not in comparable mid-latitude Southern 
Hemisphere sections (fig. S7), where AND has 
had little impact over the past decades. These 
observations are also compelling evidence that 
the observed N* signals in the NPO are due to 
the effect of AND. Thus, the balance of evidence 
suggests that the observed increase in N* in the 
upper NPO is a consequence of the recent in- 
crease in AND, resulting largely from the dramatic 
increase in anthropogenic nitrogen emissions in 
northeastern Asia. 

The possible impacts of this anthropogenic per- 
turbation on the open-ocean nitrogen cycle are 
numerous. The NPO is generally thought to be 
N-limited (2, 73); therefore, the input of nitrogen 
may increase primary and export production. In 
the mid-latitude NPO, the rate of N increase is 
equivalent to 10 ± 2% (20) of the export pro- 
duction (-1.5 X lo'® mmol C m"®^ yr"®) (27), which 
is comparable to the rate derived from AND 
modeling (5). Given the likelihood that the mag- 
nitude of AND win continue to increase in the 
future (22), the mid-latitude NPO could rapidly 
switch to having surplus N. Thus, past and future 
increases in AND have the potential to alter the 
phytoplankton community composition and, in 


so 


60 - 


>> 

IN 

I 

£ 


o 

E 

E 


.b 20 - 


[] 


O . 


O'-. 


☆ 


l~l N'-based Ne* 

O Modelded AND 

is Observed AND 



•ir 

U 


□ 


0 • 

UO'E 


o 


80 


60 — 


- 20 


O 

E 

£ 


Q 

Z 

< 


lAO^E lOOfE ISCfW lOO'W 

Longitude 


140'^V 


120 PW 
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the long term, the structure of the ecosystem. In 
addition, such anthropogenic nitrogen inputs 
into the NPO could also enhance N 2 O produc- 
tion due to an increase in remineralization in 
association with enhanced export production 
levels and potentially stimulate denitrification 
(J, 23). If similar trends are confirmed across the 
other major ocean basins, it would constitute 
another example of a global-scale alteration of 
the Earth system. 
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COGNITIVE PSYCHOLOGY 


Forgetting the presidents 

H. L. Roediger III* and K. A. DeSoto 


Two studies examined how U.S. presidents are forgotten. A total of 415 undergraduates 
in 1974, 1991, and 2009 recalled as many presidents as possible and attempted to place 
them in their correct ordinal positions. All showed roughly linear forgetting of the eight or 
nine presidents prior to the president holding office at the time, and recall of presidents 
without respect to ordinal position also showed a regular pattern of forgetting. Similar 
outcomes occurred with 497 adults (ages 18 to 69) tested in 2014. We fit forgetting 
functions to the data to predict when six relatively recent presidents will recede in memory 
to the level of most middle presidents (e.g., we predict that Truman will be forgotten to 
the same extent as McKinley by about 2040). These studies show that forgetting from 
collective memory can be studied empirically, as with forgetting in other forms of memory. 


T he name of the president of the United 
States is known to virtually all adult Amer- 
icans. When doctors wish to test the cog- 
nitive status of a concussion or stroke patient, 
they often ask the patient to identify the 
current president; a response of “Ronald Reagan” 
in 2014, for example, reveals a prohahle deficit. 
Once they leave office, however, presidents recede 
from the memory of U.S. citizens. For instance, 
today presidents such as FiUmore, Pierce, and 
Arthur are barely remembered at all, yet at one 
point in America’s past their names were known 
by all U.S. adults, just as the names Obama or 
Bush are known in 2014. 

The purpose of this project was to study how 
presidents are forgotten from collective memory. 
Collective memory, sometimes called historical 
or popular memory, refers to the representation 
of the past shared by a group (1-4). Most studies 
in this tradition focus on how events of historical 
significance are remembered (e.g., the Holocaust, 
the 9/11 attacks), whereas our focus is on histo- 
rical forgetting [see (5)]. 

We can assume that recall of a president is 
100% while the president holds office and begins 
to drop when he leaves office. Our question is: 
What is the rate at which samples of U.S. citizens 
forget the presidents over time? 

Across two studies, we determined the rate at 
which presidents recede from collective memory 
of (i) college students and (ii) a wider sample of 
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Americans (taken from Amazon Mechanical Turk; 
MTurk). We measured memory for each presi- 
dent using both ordinal position recall and free 
recall criteria. Ordinal position recall describes 
whether an individual can place a president in 
the ordinal position in which he served (e.g., lincoln 
in position 16). Free recall assesses whether an 
individual can recall a president’s name at aU, 
regardless of ordinal position. To measure for- 
getting, we applied two methods to the resulting 
sets of data. We examined the decline in recall 
within each group of subjects from the current 
president at the time of testing to the next most 
recent and so on (i.e., the recency effect in recall 
within groups of individuals). In the second 
method, we computed forgetting curves for six 
presidents across three generations of college 
students. 

In our first study, we tested three generations 
of college undergraduates in three widely sep- 
arated years: 159 subjects in 1974 (6), 106 in 1991 
(7), and 150 in 2009. In each case the students 
were given a sheet of paper numbered according 
to the number of presidents (e.g., numbers 1 through 
41 in 1991), with instructions to try to recall as 
many presidents as possible and to place them in 
their correct ordinal position. Students were told 
that if they recalled a president but not his 
ordinal position, they should guess or simply list 
that president off to the side of the page. They 
were given 5 min for recall, which prior research 
has shown is sufficient time to exhaust students’ 
knowledge (8). Figure lA shows recall of presi- 
dents as a function of their chronological term 
in office, when students were given credit for 


sciencemag.org SCIENCE 


RESEARCH \ REPORTS 


recalling them only if placed in their correct 
ordinal position; Fig. 2A shows free recall of 
the presidents, giving credit for naming the 
person without regard to order. 

In laboratory studies in which subjects are 
given a list of words or pictures to remember, a 
serial position function is obtained relating list 
position to probability of recall. This function is 
characterized by two components of interest: 
(i) the primacy effect (enhanced recall of items 
at the beginning of the list), and (ii) the recency 
effect (enhanced recall of items at the end of the 
list), both relative to lower recall of items in the 
middle of the list. The recency effect is of interest 
for our purposes because it shows how items are 
forgotten over time. Namely, the last item in a 
series is almost perfectly recalled, with preceding 
items showing systematically lower recall as a 
function of the amount of time or the number of 
intervening items (presidents in our case), count- 
ing backward from the most recent. See fig. SI 
for an example of a serial position function for 
word lists. 

Another characteristic of the serial position 
function is that a distinctive event (relative to 
other events that are similar in character) is well 
remembered even if it is in the middle of a list. 
For example, a picture appearing in the middle 


of a list of words is recalled better than the sur- 
rounding words [the isolation effect (P-ifl)]. This 
feature occurs in our studies with a natural series 
rather than a list. 

Figure lA shows that memory for ordinal po- 
sition of presidents follows a classic serial posi- 
tion function. All three groups showed a similar 
pattern of results: The first presidents were fre- 
quently recalled, with a steep drop until Lincoln, 
who showed elevated recall along with the two 
succeeding presidents (A Johnson and Grant). 
This outcome is similar to the isolation effect in 
list recall, with Lincoln (or perhaps the U.S. Civil 
War) serving as the distinctive event. Recall then 
dropped back to a low baseline until the recency 
effect became noticeable. Even though the three 
groups of students were tested over a 35-year 
time period, the recency effect was remarkably 
consistent in that it occurred for the last 9 or 10 
presidents in all groups. In addition, recall of 
presidents before the point of recency (located in 
what is called the pre-recency portion of the se- 
rial position curve) did not differ much among 
the groups; this point was confirmed in the sec- 
ond study (see below). The data also showed 
consistent forgetting of the order of recent pre- 
sidents over time by different groups of college 
students. For example, in 1974 nearly all college 


students recalled L. Johnson and his ordinal 
position (36), but by 1991 the proportion had 
dropped to 0.52 and by 2009 it had plummeted 
to 0.20. 

Whether presidents have been forgotten alto- 
gether, not merely with respect to their ordinal 
position, is better answered by plotting the data 
using a free recall criterion. These data, shown in 
Fig. 2A for the three groups of undergraduates, 
appear noisier than the data in Fig. lA Nonetheless, 
recency effects are still apparent, and what ap- 
pears to be noise may rather reflect regularities 
in the memorability of presidents. For example, 
the dips at Truman, L. Johnson, and Ford occur 
in at least two samples and were confirmed in 
the second study. 

A second method of measuring forgetting, the 
classic one in experimental psychology, is to plot 
the forgetting curve over time (or intervening 
items). The shape of this function— whether a 
logarithmic function or a power function— has 
been debated, but the power function provides a 
somewhat better fit when the two are directly 
compared { 12 ). Moreover, these forgetting func- 
tions appear to be the same across many differ- 
ent tasks { 12 , 13 ). Our data in Figs. lA and 2A 
provide little evidence of forgetting for presi- 
dents prior to and including Coolidge. Memora- 
bility of these presidents seems to have reached 
an assmiptote in U.S. students’ collective mem- 
ory. However, for presidents since Coolidge, we 
can fit a power function to the free recall data 
and estimate the rate at which they will be forgot- 
ten. We did this for the six most recent presidents 
for whom we had three data points (i.e., they 
had held office in 1974 or earlier): Truman, 
Eisenhower, Kennedy, L. Johnson, Nixon, and 
Ford. We assumed that these presidents had a 
100% recall probability after leaving office and 
used the 1974, 1991, and 2009 data to estimate 
recall probability in those years. We fit power func- 
tions to the four points for each president. These 
forgetting functions are shown in Fig. 3, and it is 
apparent that Truman, L. Johnson, and Ford are 
fading fastest irom historical memory in this group, 
whereas Kennedy has been better retained. 

We can use these data to estimate the number 
of years that elapse before presidents fall to the 
baseline levels estimated by the line in Fig. 3 (the 
line represents a mean of 0.26, the average mem- 
orability of pre-recency presidents excluding 
Lincoln, his two successors, and the first seven 
presidents). Of course, with only four data points, 
the projections are tentative. They can be seen in 
Fig. 3, but to give one example, we estimate that 
Truman wiU be forgotten by three-quarters of col- 
lege students (i.e., wiU reach 0.26 free recall prob- 
ability) by 2040, 87 years after leaving office. 

The data from the study described above were 
collected from students of the same age cohort 
(18 to 22) at three points in time across 35 years. 
In our second study, we used a complementary 
tactic: We tested adults across varying age ranges 
in May 2014 on the same task. We tested 116 
adults aged 18 to 29 (millennials), 207 adults 
aged 30 to 49 (generation X), and 174 adults 
aged 50 to 69 (baby boomers) { 14 ). These three 



Fig. 1. Ordinal position recall probabilities. Ordinal position recall of presidents across (A) college 
students of three different generations (data collected in three different years) and (B) American adults of 
three different generations (data collected in 2014). 
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generations correspond approximately to the 
three generations of college students that partic- 
ipated in the first study (e.g., college students in 
1974 are the same group that we call baby boomers 
in 2014). We sought to determine whether the 
serial position curves obtained in our earlier 
studies would replicate with this larger sample 
of adults. 

The data in Fig. IB show that the answer is gen- 
erally yes with strict positional recall, except that 
the recency part of the curve shows more overlap. 
In fact, recall of presidents from Washington to 
Coolidge reveals hardly any differences among 
the three age bands, and the data are similar to 
those in Fig. lA. This lends support to our sup- 
position from the first study that recall of these 
presidents has reached the asymptote of the 
forgetting curve. The differences emerge in the 
recency part of the curve, with older adults re- 
calling more presidents in order than younger 
adults, at least ifom Coolidge through L. Johnson. 
Free recall results (Fig. 2B) again replicated the 
results in Fig. 2A reasonably weU. The correlations 
between the patterns of data of people of similar 
age in Fig. 1, A and B, were r = 0.91, 0.95, and 0.98 
for the baby boomer, generation X, and millen- 
nial samples, respectively (i.e., the data collected 
in 1974 correlated 0.91 with the data collected for 
the baby boomer group assessed in 2014). The 
similar values for the free recall data in Fig. 2, A 
and B, were r = 0.93, 0.94, and 0.97. All corre- 
lations were significant (P < 0.001); see tables SI 
and S2 for the full correlation matrices. These 
strong correlations indicate that serial position 
curves in recall of U.S. presidents are remarkably 
similar and stable across age and across gener- 
ations spanning 35 years. 

The results from our two studies show forget- 
ting of presidents in terms of their correct ordi- 
nal position in office (Fig. 1) and simply their 
name and the fact that they served (Fig. 2). The 
recency effect indicates that subjects can retrieve 
presidents relatively well if the presidents held 
office during or just before the subjects’ lifetimes, 
but that recall of correct ordinal position drops 
as one attempts to retrieve presidents whose 
terms were more distant in time. By analogy to 
serial position curves in recall of lists, recent 
presidents appear to exist in a state of heightened 
accessibility, shown by the greater recency effect 
that is akin to a short-term component for other 
memory tasks involving recent events or people 
[according to one theory (75); see also (75)]. That 
is, collective memory has a window on the past 
that recedes, although doubtless through mecha- 
nisms different from those involved in short-term 
recall of lists. 

Another interesting finding in our data is how 
consistently the pre-recency presidents are remem- 
bered across generations of students in the first 
study, as confirmed by the replication in our 
second study. All samples also recalled Lincoln 
and his immediate successors better than most 
presidents and also recalled their ordinal posi- 
tions. Why are these data so consistent? A possi- 
ble explanation is that presidents who are viewed 
as having been more historically influential are 


better remembered across generations. Accord- 
ing to one theory (77), forgetting is adaptive and 
corresponds to environmental demands for needs 
of information. By this view, recall of presidents 
may be due to their frequency of mention in 
popular media, and frequency of mention may 
be determined by importance. To investigate, we 
correlated the recall scores from our second 


study with presidential rankings provided by 
history scholars and others (18). The correlation 
between recallability and ranking for presidents 
from Washington to Coolidge was high [r(27) = 
0.73, P < 0.001]. This finding is in line with the 
theory and data in (77). 

Another theory argues that position in a series 
accounts for some of the variance in presidential 
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Fig. 2. Free recall probabilities. Free recall of presidents across (A) college students of three different 
generations (data collected in three different years) and (B) American adults of three different generations 
(data collected in 2014). 


Fig. 3. Predicted for- 
getting curves. 

Forgetting curves for 
six presidents were 
predicted using the 
power function and 
data from the three 
generations of college 
students. The points 
show the original data. 
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recall (especially ordinal position recall), perhaps 
explaining why the second president, John Adams, 
is recalled in position better than Thomas 
Jefferson, a figure of greater historical impor- 
tance {6, 7). Other researchers have supported 
this claim {19, 20), but there are also dissenters 
{21-23). Qearly Lincoln and his successors are 
well remembered because of their association 
with the American Civil War and the ending of 
slavery, but it is notable that many students and 
adults also often know that Lincoln was the 16th 
president. This superior recall of a salient event 
in a series resembles the isolation effect in list 
recall {9-11). 

Our results show that memories of famous 
historical people and events can be studied ob- 
jectively [see also {4, 5, 24, 25)]. We find two 
different functions that characterize forgetting of 
the presidents. First, for individuals, memory 
for the order of presidents who served in office 
during the individual’s lifetime (or a few years 
before) declines linearly. Second, forgetting of 
presidents across generations follows a power 
function until an asymptote is reached, in line 
with data from many other domains [(72, 13); 
see (5) for similar results]. The asymptote prob- 
ably reflects both the importance and frequency 
of mention of the particular president (77). The 
other notable feature of our results is the highly 
consistent recall of presidents by college students 
from several universities across a spread of 35 
years, as well as by other Americans recruited 
from MTurk in 2014. The high correlations point 
to a great stability in how the presidents are 
remembered across generations— a seemingly per- 
manent form of collective memory [i.e., semantic 
memory (26)]. 
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MITOCHONDRIA 

Cell cycle-dependent regulation of 
mitochondrial preprotein translocase 
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Mitochondria play central roles in cellular energy conversion, metabolism, and apoptosis. 
Mitochondria import more than 1000 different proteins from the cytosol. It is unknown 
if the mitochondrial protein import machinery is connected to the cell division cycle. 

We found that the cyclin-dependent kinase Cdkl stimulated assembly of the main 
mitochondrial entry gate, the translocase of the outer membrane (TOM), in mitosis. 

The molecular mechanism involved phosphorylation of the cytosolic precursor of Tom6 
by cyclin Clb3-activated Cdkl, leading to enhanced import of Tom6 into mitochondria. 
Tom6 phosphorylation promoted assembly of the protein import channel Tom40 and 
import of fusion proteins, thus stimulating the respiratory activity of mitochondria in 
mitosis. Tom6 phosphorylation provides a direct means for regulating mitochondrial 
biogenesis and activity in a cell cycle-specific manner. 


M itochondria are crucial for numerous 
tasks, from adenosine 5'-triphosphate 
ssmthesis and metabolism of amino acids, 
lipids, iron, and heme to apoptosis (1-8). 
Mitochondria cannot be formed de novo, 
but can be formed only by growth and division of 
preexisting organelles. Growth of mitochondria 
depends on the import of a large number of csto- 
solically synthesized precursor proteins. Multiple 
pathways of protein import into mitochondria 
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have been identified (2, 3, 9). Nearly aU pathways 
use the main mitochondrial entry gate, the trans- 
locase of the outer membrane (TOM). Mitochon- 
dria form a dynamic network that is continuously 
remodeled by fusion and fission events that, to- 
gether with cytoskeleton-dependent transport 
and anchoring in daughter and mother cells, en- 
sure a proper distribution of mitochondria dur- 
ing cell division (5, 7, 10-13). Cell cycle-dependent 
regulation of mitochondrial components has been 
observed in a few cases, including activation of 
the fission protein Drpl, of respiratory complex I, 
and of a mitochondrial DNA binding protein; 
the fusion protein Fzol is degraded upon arrest 
of yeast cells in Gi phase (14-19). Whereas mito- 
chondrial protein Import is regulated by csdosolic 
kinases under different metabolic conditions (res- 
piratory versus nonrespiratory) (20, 21), whether 
protein import and the cell cycle are connected 
is not clear. 

We used budding yeast as model organism 
to study the mitochondrial protein import ma- 
chinery in different phases of the cell cycle. 
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Spellman et al. (22) performed a genome-wide 
screen for mRNA levels during the yeast cell 
cycle. We analyzed the screen for components of 
the mitochondrial protein import machinery 
and noted a moderate increase of TOMS mRNA 
levels, but not of other translocase components 
in Ga/M phase (fig. SIA). Tom6 is a small inte- 
gral membrane protein involved in the assem- 
bly of the TOM complex (23-23). To analyze if 
the change of mRNA levels resulted in altered 
Tom6 protein levels in different phases of the 
cell cycle, yeast cells were arrested in Gj phase 
by the pheromone a-factor, in S phase by the 
DNA synthesis inhibitor hydroxyurea, and in M 
phase by the spindle poison nocodazole (26)\ 
cdcl4-l yeast mutants were also arrested in M 
phase Oate anaphase) (27, 28). The protein levels 
of Tom6 were increased in M phase (Fig. lA; fig. 
SI, B to D; and table SI). 

We asked if the large increase in mitochon- 
drial Tom6 levels in M phase involved a direct 
regulatory mechanism in addition to the moder- 


ate increase in mRNA levels. Serl6 of yeast Tom6 
is a potential target site for cyclln-dependent 
kinases (CDKs) (20). Tom6 consists of an N- 
terminal hydrophOic domain and a C-terminal 
membrane anchor. A fusion protein containing 
the N-terminal domain was incubated with yeast 
cytosolic extract or purified Cdkl (Cdc28), the 
CDK essential for cell cycle progression in yeast 
(29). Both c 5 dosolic extract and purified Cdkl 
(but not the CDK Pho85) phosphorylated Tom6; 
addition of the CDK inhibitor NU6102 or re- 
placement of Ser“ by alanine blocked phospho- 
rylation (Fig. IB and fig. S2) (for the receptors 
Tom20, Tom22, and Tom70, CDKl- or cell cycle- 
dependent phosphorylation was not observed; 
fig. S3). The full-length precursor of Tom6 was 
phosphorylated with CDKl and incubated with 
isolated mitochondria. Phosphorylated Tom6 
was efficiently targeted to mitochondria and 
membrane integrated; in the absence of CDKl 
or upon replacement of Ser“ by alanine, import 
of Tom6 into mitochondria was strongly inhib- 


ited (Fig. 1, C and D). The stability of imported 
Tom6 was not significantly altered by phospho- 
rylation (fig. S4). Thus, Cdkl-mediated phospho- 
rylation of the Tom6 precursor promotes its 
efficient import into mitochondria. 

The mitochondrial levels of Tom40, the cen- 
tral channel-forming subunit of the TOM com- 
plex, were also increased in M phase (Fig. lA, 
fig. SID, and table SI). To determine if phos- 
phorylation of Tom6 caused an alteration of 
the TOM complex, we generated yeast strains 
in which Ser'® of Tom6 was replaced either by 
alanine or phospho-mimetic glutamate. The 
levels of the TOM complex were decreased in 
Tom6®'“ mitochondria, yet increased in Tom6®®^ 
mitochondria (Fig. 2A). Tom6®^®^ mitochondria 
contained increased levels of Tom40 (Fig. 2B). 
Tom6 is required for the stability of the TOM 
complex and thus in tomSA mitochondria, the 
amount of mature TOM complex is diminished 
(Fig. 2A) (23, 23). We asked if Tom6 was rate- 
limiting for Tom40 assembly and generated a 
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yeast strain overexpressing Tom6 (fig. S5A). 
Radiolabeled Tom40 precursor was incubated 
with isolated mitochondria, and the assembly 
stages were analyzed (24, 25). Overexpression 
of Tom6 stimulated the assembly of Tom40, but 
not of other p-barrel proteins like poiin that are 
not assembled into TOM; the use of Tom6®^®^ 
mitochondria also stimulated Tom40 assembly, 
whereas Tom6®'®'^ mitochondria were impaired 
in its assembly (Fig. 2, C to E, and fig. S5B). Thus, 
the Cdkl-triggered increase in mitochondrial 
Tom6 levels stimulates assembly of the mito- 
chondrial protein import channel Tom40. 

Yeast contains nine cyclins that activate Cdkl 
in distinct cell cycle phases (Gi, S, or G 2 /M) (29, 30). 
To synchronize growth of yeast cells, they were 
arrested in Gi by a-factor, followed by removal of 
a-factor (2ff). We purified Cdkl ifom different cell 
(ycle phases under rruld conditions (to maintain 
association with cyclins) and probed phosphoryl- 
ation of the Tom6”-fusion protein. Cdkl purified 
from G 2 /M phase was most efficient in phospho- 
rylation of Tom6 (Fig. 3A). To identify the respon- 
sible cyclin, we expressed and purified individual 
(yclins and combined them with Cdkl. The G 2 /M 
(yclin Qb3 activated Cdkl for phosphorylation of 


Tom6-Serl6 (Fig. 3B), whereas Cdkl from clb3- 
deficient yeast was strongly inhibited in Tom6 
phosphorylation (Fig. 3C) [as control, neither 
Pho85 nor Ssn3 (Cdk8) phosphorylated Tom6]. 
Thus, Clb3 activates Cdkl to phosphorylate Tom6 
in G 2 /M phase. The assembly of Tom40 occurred 
with the highest efficiency in yeast cells arrested 
in M phase (Fig. 3D), in agreement with our mod- 
el that Tom6 phosphorylation by Cdkl in G 2 /M 
phase promotes Tom40 biogenesis. 

The levels of the two mitochondrial fusion gua- 
nosine triphosphatases (GTPases), outer-membrane 
Fzol and in particular inner-membrane Mgml 
(10-12), were increased in M phase (Fig. lA, fig. 
SID, and table SI). Overexpression of Tom6 stim- 
ulated Mgml import into mitochondria (Fig. 4A). 
The hydrophilic cytosol-exposed domain of Tom6 
bound the precursor of Mgml, but not the model 
preprotein b2(167>dihydrofolate reductase (DHFR); 
binding of Mgml depended on the phosphoryl- 
ation of Tom6 by CDKl and was blocked by re- 
moval of the N-terminal segment of the Mgml 
presequence (Fig. 4B). Thus, phosphorylated Tom6 
specifically recognized the Mgml precursor. Mito- 
chondrial import of Fzol was increased upon 
overexpression of Tom6 as well as after arrest 


in M phase (Fig. 4C and fig. S6A). Because the 
Fzol precursor did not bind to the cytosol- 
exposed domain of Tom6 (Fig. 4B), the trans- 
membrane segment of Tom6 may be involved 
in Fzol import. We imported the Fzol precursor 
and shortened constructs into mitochondria 
containing tagged full-length Tom6. Copurifica- 
tion of Fzol with Tom6 (Fig. 4D) depended on a 
region containing the first transmembrane seg- 
ment of Fzol and was enhanced by the addi- 
tional presence of its intermembrane space loop 
(fig. S6B). Whereas the import of control pre- 
proteins was not affected by Tom6 or by cell cycle 
phases (fig. S7), the increase in Fzol and Mgml 
levels in M phase was blocked in Tom6®^®'' mu- 
tants (Fig. 4E). The stability of Fzol and Mgml 
was not altered in M phase (fig. S8) (Fzol levels 
are decreased upon Gi arrest (14)). Neither the 
mitochondrial levels of Fisl nor recruitment of 
the fission GTPase Dnml (10-12) to mitochondria 
were altered by Tom6 phosphorylation (Fig. 4E 
and fig. S9). Thus, phosphorylation of Tom6 stim- 
ulates TOM assembly and mitochondrial import 
of Fzol and Mgml in G 2 /M phase. 

To characterize the consequences on cell 
growth, wild-type and Tom6®'“ mutants were 


Fig. 2. Tom6-Serl6 
phosphorylation 
enhances Tom40 
biogenesis. (A) TOM 

complex of Tom6 
mutant mitochondria 
analyzed by blue native 
electrophoresis of 
digitonin-lysed 
mitochondria and 
immunoblotting. 

(B) Steady-state levels 
of mitochondrial 
proteins in Tom6 
mutant mitochondria. 

(C) p-barrel assembly 
in mitochondria from 
WT and a Tom6- 
overexpressing strain 
using [^^S]Tom40 or 
[^^SJPorin precursor, 
analyzed by blue native 
electrophoresis. Tom40 
assembly includes 
intermediate I (binding 
to sorting and 
assembly machinery 
SAM), intermediate II, 
and formation of 
mature TOM (35). 

(D) Assembly of 
[^^S]Tom40 precursor 
in mitochondria of 
Tom6 phosphomutants. 

(E) Assembly of Tom40 
and Porin as in (C) 
and (D) (mean + SEM, 
/V > 3: WT controls set 
to 100%). 
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synchronized by a-factor arrest and their growth 
compared. The initial growth curves of both strains 
were similar up to M phase. Then, the growth of 
Tom6®^®^ mutants lagged considerably behind 
that of the wild type, and progression of the cell 
cycle was delayed (Fig. 4F and fig. SIO). Oxygen 
consumption was increased in M phase, yet not 
in Tom6®^®'^ mutants (Fig. 4G and fig. Sll). The 
mitochondrial morphology was comparable be- 
tween wild-type and Tom6®“* mutant yeast (fig. 
S12; unless the mitochondrial morphology was 
compromised by an additional mutation, fig. S13). 
Thus, phosphorylation of Tom6 and its effect on 
the levels of fusion GTPases primarily affected 
the bioenergetic activity of mitochondria. Mgml/ 
OPAl is required for fusion and for the mainte- 
nance of cristae and thus can affect the energetic 
capacity of the oxidative phosphorylation ma- 
chinery (7, 12, 31, 32). Overexpression of Mgml 
indeed enhanced oxygen consumption (fig. Sll). 
Thus, phosphorylation of Tom6 in G 2 /M stim- 
ulates the import of fusion GTPases, in particular 
of Mgml, and enhances the bioenergetic activity 
of mitochondria. 

The cell division cycle and import of proteins 
into mitochondria are essential for cell propaga- 
tion and growth (1-3, 13, 29, 33). Here we have 
found a direct molecular connection between 
cell cycle control by Cdkl and assembly of the 
main protein entry gate of mitochondria. Phos- 
phorylation of the mitochondrial assembly factor 
Tom6 by Cdkl-Clb3 initiates a reaction cascade 
of enhanced Tom6 targeting and assembly of 
the protein import channel Tom40. Increased 
TOM6 mRNA levels in G 2 /M underscore the 
importance of the assembly factor in this phase 
of the cell cycle (fig. SIA) (22). Screening of the 
PHOSIDA database revealed a conserved phos- 
phorylated M-phase peptide of human Tom6 
(34), indicating that phosphorylation of Tom6 
has been conserved in yeast and humans (fig. S14). 
Notably, out of many mitochondrial proteins 
tested, the levels of assembled TOM complex 
and of membrane fusion GTPases are increased 
in G 2 /M. Phosphorylation of Tom6 stimulates 
the mitochondrial import of Mgml, leading to an 
increased respiratory activity of mitochondria. 
The increased supply of bioenergy can promote 
energy-dependent steps of cytokinesis (18). This 
direct link between cell cycle and mitochondrial 
protein import suggests a specific role for the 
import of fusion proteins for mitochondrial ac- 
tivity and efficient progression through the cell 
division cycle. 
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The origin of contemporary Europeans remains contentious. We obtained a genome sequence 
from Kostenki 14 in European Russia dating from 38,700 to 36,200 years ago, one of the 
oldest fossils of anatomically modern humans from Europe. We find that Kostenki 14 shares 
a close ancestry with the 24,000-year-old Mal'ta boy from central Siberia, European Mesolithic 
hunter-gatherers, some contemporary western Siberians, and many Europeans, but not eastern 
Asians. Additionally, the Kostenki 14 genome shows evidence of shared ancestry with a 
population basal to all Eurasians that also relates to later European Neolithic farmers. We 
find that Kostenki 14 contains more Neandertal DNA that is contained in longer tracts than 
present Europeans. Our findings reveal the timing of divergence of western Eurasians and East 
Asians to be more than 36,200 years ago and that European genomic structure today dates 
back to the Upper Paleolithic and derives from a metapopulation that at times stretched from 
Europe to central Asia. 


T he ancestors of contemporary Eurasians are 
believed to have left Africa some 60,000 to 
50,000 years ago (60 to 50 ka) (1, 2), possibly 
30,000 to 40,000 years later than Australo- 
Melanesian ancestors (3). Despite contro- 
versies about routes out of Africa, the first Upper 
Paleolithic (UP) industries of Eurasia are found 


in the Levant from ~48 ka (4, 5). Expansion into 
Europe took place through multiple events that 
by -40 ka had generated a spatially and cultu- 
rally structured anatomically modern human 
(AMH) population— from Russia (6) to Georgia 
(7), Bulgaria (S), southern Europe (9, 10), and 
the United Kingdom (11). The few AMH fossils 
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associated with these initial UP industries are 
morphologieatly variable (9, 12-17). In western 
Eurasia, the distinctive Aurignacian toolkit, first 
observed at Willendorf (Austria) by 43.5 ka {18), 
became predominant across the earlier range 
by 39 ka. Although analyses of ancient human 
genomes have advanced our understanding of 
the European past, revealing contributions from 
Paleolithic Siberians, European Mesolithic, and 
Near Eastern Neolithie groups to the European 
gene pool (19-23), the possible contribution of the 
earliest Eurasians to these later cultures and to 
contemporary human populations remains un- 
known. To investigate this, we sequenced the ge- 
nome of Kostenki 14 (K14, Markina Gora) (Fig. lA). 

The locality of Kostenki-Borshchevo on the 
Middle Don River, Russia, has one of the most 
extensive Paleolithic records in eastern Europe. 
The K14 human skeleton was excavated in 1954 
(24) and was recently dated to 33,250 ± 500 ra- 
diocarbon years before the present (B.P.) (23), 
38.7 to 36.2 thousand calendar years B.P. (ky cal 
B.P.), in agreement with the stratigraphic posi- 
tion of the burial that cuts into the Campanian 
Ignimbrite ash layer dated to -39.3 ky cal B.P. 
(26). Below the skeleton, there is a distinctive 
early UP industry, with end scrapers, burins, 
prismatic cores, and bone artifacts Qayer IV); the 
cultural layer above Gayer III) has a regionally 
loeal character (27, 28) [supplementary materials 
(SM) SI and S2]. 

We performed 13 DNA extractions from a total 
of 1.285 g of the left tibia (dorsal side of the shaft), 
using two extraction methods based on silica 
purification (29, 30). We first constructed seven 
niumina libraries and validated the presence of 
typieal signatures of postmortem DNA damage, 
using a fraction of DNA extracts (SM S3). The 
remaining extracts were built into 63 libraries 
after enzymatic uracil-specific excision reagent 
(USER) treatment to limit the effect of nucleo- 


'Centre for GeoGenetics, Natural History Museum of Denmark, 
University of Copenhagen, 0ster Voidgade 5-7, 1350 
Copenhagen, Denmark. ^Department of Zooiogy, University of 
Cambridge, Downing Street, Cambridge, CB2 3EJ, UK. 
^Department of Human Genetics, University of Chicago, 920 East 
58th Street, Cummings Life Science Center, Chicago, iL 60637, 
USA. “The Bioinformatics Center, University of Copenhagen, Oie 
Maaipes Vej 5, 2200 Kobenhavn N, Denmark. ^Department of 
integrative Bioiogy, University of Caiifomia, Berkeiey, CA 94720, 
USA. ^Department of Physicai Anthropology, Kunstkamera, Peter 
the Great Museum of Anthropology and Ethnography, Russian 
Academy of Sciences, 24 Srednii Prospect, Vassilievskii Island, 

St. Petersburg, Russia. ^Center for the Study of the Eirst 
Americans and Department of Anthropology, Texas A&M 
University, TAMU-4352, College Station, Texas 77845-4352, 
USA. ^Environmental Futures Research Institute, Griffith 
University, 170 Kessels Road, Nathan, Brisbane, Queensland 
4111, Australia. ^Department of Epidemiology and Biostatistics, 
University of California San Francisco, 185 Berry Street, Lobby 
5, Suite 5700, San Francisco, CA 94107, USA. “Division of 
Archaeology, University of Cambridge, Cambridge, Downing 
Street, CB2 3DZ, UK. “Department of Human Evolution, 
Max-Planck-lnstitute for Evolutionary Anthropology, Leipzig, 
Deutscher Platz 6, D-04103, Germany. “Leverhulme Centre for 
Human Evolutionary Studies, Department of Archaeology and 
Anthropology, University of Cambridge, Cambridge, Fitzwilliam 
Street, CB2 IQH, UK. 

*These authors contributed equally to the work. fCorresponding 
author. E-mail: ewillerslev@snm.ku.dk (E.W.), rasmus_nielsen@ 
berkeley.edu (R.N.), mbmll@cam.ac.uk (M.M.L.) 


tide misincorporations in downstream analyses 
(31) (table S2). Additionally, a limited fraction of 
two DNA extracts was purified for methylated 
DNA fragments using methyl binding domain 
(MBD) enrichment (32) before USER treatment 
and library building, for a total of eight DNA 
libraries. Following stringent quality criteria for 
read alignment, we identified a total number of 
175.2 million unique reads aligning against the 
human reference genome hgl9, representing an 
average depth of coverage of 2.84X (SM S4). The 
eight USER-treated DNA libraries that exhibited 
limited error rates and contamination levels were 
selected for further analyses. This restricted the 
data set to 148.9 million unique reads, represent- 
ing a final depth of coverage of 2.42X. We ex- 
ploited the fact that K14 was a male and used 
the heterozygosity levels present in the X chro- 
mosome to estimate overall levels of contamina- 
tion around 2.0% (SM S5 and S6 and table S5). 
The population genetics analyses results are ro- 
bust to contamination of that level. In particular, 
we replicated the main analyses with selected 
libraries with varying contamination levels and 
observed no qualitative effect on the results (see 
SM S9 for details). 

Mitochondrial analyses confirmed the sequence 
previously reported for K14 [haplogroup U2 (33)\ 
which supports data authenticity. The Y chromo- 
some belongs to haplogroup C M130, the same as 
in La Brafia— a late Mesolithic hunter-gatherer 
(MHG) from northern Spain (22) (SM S7). 

To identify patterns of shared ancestry and ad- 
mixture among K14, other ancient genomes and 
contemporary Eurasians [based on a single- 
nucleotide polymorphism (SNP) array panel of 
2091 individuals from 167 populations], we carried 
out a series of analyses— model-based cluster- 
ing and principal component analysis (PCA)— to 
show the contribution of diverse genetic com- 
ponents within K14: D statistics to explore the 
affinity of K14 to pairs of populations (using 
Mbuti Pygmy as an outgroup); statistics to test 
whether a given modern population is equi- 
distant to an ancient individual and a particular 
recent group (here, Sardinians), given an out- 
group (here, Papuans); and statistics to ex- 
plore both patterns of admixture (“admixture”^) 
and shared ancestry (“outgroup”^). Key results 
were also replicated using two whole-genome 
sequencing data sets of modem individuals from 
worldwide populations (23, 34). 

Model-based clustering analyses (33) show that 
K14 has different genetic components of substan- 
tial size (Fig. IB and SM SIO), suggesting the 
sharing of sets of alleles with different Eurasian 
groups. The largest fraction of K14’s ancestry 
derives from a component that is maximized in 
European MHGs and also predominant in con- 
temporary northern and eastern Europeans. The 
genetic affinity of K14 to contemporary Euro- 
peans is also observed using outgroup /g sta- 
tistics (36). Using Mbuti Pygmy as an outgroup, 
we find that among a panel of 167 contemporary 
populations, Europeans have the greatest affinity 
(i.e., the largest^) to K14 (Fig. 1C). This conclu- 
sion is also formally supported by comparing 


pairs of populations to K14 using the D statistics 
of the form D(Mbuti Pygmy, K14; Population 1, 
Population 2). This statistic is expected to be equal 
to zero if K14 is symmetrically related to Popula- 
tion 1 and Population 2, whereas its expectation is 
negative (positive) if K14 is more closely related to 
Population 1 (Population 2). For pairs of popula- 
tions involving East Asians (Population 1) and 
Europeans (Population 2), K14 is always signif- 
icantly more closely related to Europeans [e.g., 
Z = 12.1 (Han and Lithuanians)], in aU data sets 
analyzed (SM S9 and table S7). We also con- 
firmed that these results are robust to possible 
contamination from a modem DNA source by 
filtering for reads with a high likelihood of ancient 
DNA using a model-based approach (37), as well 
as calculating contamination-corrected D statis- 
tics (23) (SM S9 and fig. S18). 

Within Europe, northern Europeans show the 
closest afSnily to K14, based both on the^ (Fig. ID) 
and D statistics [e.g., Z = 6.7 for Sardinians and 
Lithuanians; table S7 and fig. S16]. This pattern 
closely resembles that of European MHGs (La 
Brafia, Ajv58, Loschbour, and Motala) and Malta 
(MAI) (figs. S14 and S15), with the exception of 
the latter’s strong genetic affinity with Native 
Americans, which is unique to that individual. 
Furthermore, a direct comparison to ancient ge- 
nomes in the outgroup statistics shows that 
K14 has a higher affinity with MHGs (Loschbour 
and La Brafia) than any other ancient individual 
or contemporary population (fig. S14). Together 
with the rare Y chromosome lineage shared with 
La Brafia, these results provide strong evidence 
of shared ancestry and extensive gene flow be- 
tween UP West Eurasian people related to K14 
and European MHGs and their contemporary 
European descendants. 

An interpretation of the above results would 
be that K14 is an early member of a lineage 
leading to western Eurasian MHGs after their 
split from the proposed ancestral northern Eur- 
asian lineage, including MAI. However, D statis- 
tics of the form ZXMbuti Pygmy, Modem; Ancient, 
K14)— which test whether K14 and an ancient 
individual form a clade with respect to a modem 
population— reject this simple tree-like relation- 
ship. We find that all contemporary non-Africans, 
except Australo-Melanesians, are closer to either 
MAI or MHGs than to K14 [e.g., Z = -5.3 for D 
(Mbuti and Han; Loschbour and K14); SM S9, 
table SIO, and fig. S19]. This would suggest a 
basal position of K14 with respect to MHGs and 
ancient north Eurasians, which is also shown 
in admixture graphs using TreeMix (SOM S12 
and figs. S24 and S25). In addition, a sizeable 
component of K14’s ancestry observed in the 
model-based clustering analyses is predomi- 
nant in contemporary Middle Eastem/Caucasus 
(ME/C) populations and Neolithic ancient ge- 
nomes (NEOL) (Gok2, Iceman, and Stuttgart) 
but absent in MAI or MHGs (Fig. IB and fig. S20). 
This component has been associated with a sug- 
gested “basal Eurasian” lineage contributing to 
NEOL to explain an observed increase in allele 
sharing between MHGs/MAl and East Asians 
compared with NEOL (21). Because K14 shows 
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the same pattern as NEOL, a parsimonious expla- 
nation would be that K14 also derives some an- 
cestry from a related basal Eurasian lineage. 
Consistent with this hypothesis, we find that East 
Asians are equally distant to NEOL and K14, using 
D statistics as described above [e.g., Z = 0.0 for D 
(Mbuti, Han; Stuttgart, K14); tables SIO and Sll]. 
This suggests that the main ancestral components 
proposed for contemporary Europeans, including 
the Middle Eastern component commonly attri- 
buted to the expansion of early farmers within 
Europe, were likely already genetically differen- 
tiated and related through complex gene flow 
by the time of K14, at least 36.2 ka (Fig. 2). 

We further investigated the relationship of K14 
and the other ancient genomes to East Asian and 
Siberian populations using;^ statistics^(Sardinian, 
Ancient; Modem, Papuan), which measure wheth- 
er a modem population shares more alleles with 
contemporary Europeans or an ancient genome. 


We find that all Siberian and East Asians are 
equally distant from western MHGs (ah |Z| < 1.9) 
(Fig. 3D and table S12), supporting the postu- 
lated early split between East Asians and west- 
ern Eurasians. In contrast to MHGs and MAI, 
all Siberian populations are genetically closer 
to contemporary Europeans (Sardinians) than 
to K14 (3.1 < \Z\ < 9.9) (table S12), particularly 
those from the Yenisei and Ob’ basins (e.g.. Shots, 
Z = 8.0) (Fig. 3A). Furthermore, these popula- 
tions derive parts of their ancestry from a Euro- 
pean hunter-gatherer (HG) component inferred 
in the clustering analysis (Fig. ID and fig. S20), 
with populations showing a higher HG ancestry 
proportion also being closer to contemporary 
Europeans, using the/t statistic (Spearman p = 
0.96; P = 3.0 X 10““) (Fig. 3D and table S13). 
Notably, the opposite pattern is observed with 
Scandinavian MHGs (Ajv58 and Motala), where 
the same populations tend to share more alleles 


with MHGs than contemporary Europeans and 
the HG component is negatively correlated with 
(e.g., Motala p = -0.85; P = 6.2 x 10”“) (Fig. 3, C 
and D). Calculating admixture^ statistics, we 
find significant evidence for admixture in those 
populations, with a variety of Siberian and Eu- 
ropean source populations. The best pair of source 
populations (i.e., the most negative^ statistic) 
involves Swedish MHGs (Motala) and Evens (a 
northeast Siberian population) [e.g., /a (Shors; 
Evens, Motala) = -0.012; Z = -9.1] (table S14). 
Altogether, these results suggest that contempo- 
rary Siberian populations from the Yenisei basin 
derive part of their gene pool from a Eurasian 
HG population that shares ancestry with K14 
but is more closely related to Scandinavian MHGs 
than to either MAI or western European MHGs, 
indicating gene flow between their ancestors 
and Scandinavian Europe after K14 but before 
the Mesolithic (between 36.3 and 7 ky B.P.). 
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Fig. 1. Sampling locations and genomic affinities of K14 and other ancient genomes. (A) Location of Kostenki and the samples analyzed in this study. 
Kostenki (K14) is shown in red, while comparative ancient samples are shown in blue. (B) Admixture proportions for the ancient genomes, assuming nine 
ancestral components for a clustering analysis in a set of modern worldwide populations. We labeled the components according to the modern populations in 
which they are maximized for all but one case: The yellow component that we label HG is maximized in eastern Europeans. NEOL: Neolithic farmers. (C) Shared 
drift between K14 and a set of worldwide populations. Eor every modern population X on the map, we compute t 3 (Mbuti Pygmy: K14, X). The warmer colors 
indicate increased shared ancestry. (D) Shared drift between K14 and a set of European populations. This figure is a close-up of (C). 
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Fig. 2. Relationships of the K14 sample and MAI, MHG, NEOL, 
modern Europeans, and the modern populations in the Yenisei 
region. This representation is a possible topoiogy consistent with 
the results presented in this study in the context of the relation- 
ships described by Lazaridis et a/. (21) for the modern European 
populations and Raghavan et at. (23) for MAI. Present-day pop- 
ulations are colored in blue, ancient poplation in red, and ances- 
tral populations in green. Solid lines represent descent without 
admixture events, and dashed lines show admixture events. Ar- 
rows do not depart from ancient samples (K14 and MAI) because 
they represent relationships of population ancestry. We only show 
the topology of the potential population tree: There is no notion of 
time in this representation. The tree is not the result of a model- 
fitting procedure but rather a possible topology consistent with the 
key results (A, B, and C) of this study. 
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Fig. 3. Relationships of K14 and other HG genomes with contemporary East Asian and Siberian populations. (A) Values of the U statistic for a set of 
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Finally, we estimated levels of Neandertal an- 
cestry in K14 using^-ratio statistics (38). Our es- 
timates are consistent with previous analyses (34) 
showing a Neandertal contribution lower than 
2% for most individuals (Fig. 4A). However, hoth 
La Brafia and K14 show slightly elevated levels, 
with an estimated 2.4 ± 0.4% in K14 (tables S15 
and S16). Restricting this analysis to genomic re- 
gions without evidence for Neandertal introgressed 
haplotypes in contemporary humans (38, 39) re- 
sults in 0% estimated ancestry for most individ- 
uals except K14, where 0.9 ± 0.4% Neandertal 
ancestry is still detected (tables S17 and S18). The 
difference between K14 and modem genomes could 
be caused by several factors, including sampling 
effects and genetic drift, natural selection as ar- 
gued in (38, 39), or by the effects of additional 
Neandertal admixture not represented in the mod- 
ern gene pool. We next compared the size dis- 
tribution of genomic tracts of archaic hominin 
origin in K14 and other ancient individuals (Fig. 
4B) by identifying genomic regions with high fre- 
quencies of archaic alleles at sites where all mod- 


em Africans carry the ancestral allele. The length 
of Neandertal tracts was higher in K14 than in 
other ancient individuals, with the longest tract 
totaling ~3 Mb on chromosome 6 (Fig. 4C). This 
is consistent with K14 being closer to the time of 
the admixture event with Neandertals, and carry- 
ing longer archaic tracts that have been affected 
by less recombination, than in the other -11,000- 
to 30,000-year-old younger ancient genomes. We 
then used the length distribution of shared an- 
cestry to estimate the admixture time of Nean- 
dertals and humans based on the K14 sample 
and obtained an estimate of -54,000 years (S15). 
We note that genomic data from a 45,000-year- 
old modem human from Siberia, which was pub- 
lished during the review process of this study, 
also shows longer segments of Neandertal an- 
cestry, further supporting our conclusions (40). 
Because of the divergent position of the K14 sam- 
ple, we also examined whether it contained any 
fragments of introgressed DNA from other pre- 
viously unsampled hominins. However, the dis- 
tribution of tracts of divergent DNA provides no 


evidence for additional divergent introgressed 
DNA (S14). 

Several studies have reported on the basal ge- 
netic distinctiveness between western Eurasian 
and eastern Asian populations, as well as between 
all Eurasians and Australo-Melanesians (41-43). 
Our results show no close genetic relationship 
between K14 and Australo-Melanesians and sup- 
port earMer studies that suggest Australo-Melanesians 
derive part of their ancestry from an early popu- 
lation divergence that predates the separation of 
Europeans and East Asians (3). The K14 genome 
shows that this early UP individual was clearly 
part of a western Eurasian lineage that had al- 
ready diverged from eastern Asians, thus estab- 
lishing a minimum date for that separation at 
least 36.2 ka. The fact that the limited genomic in- 
formation on the -40 ka Tianyuan modem human 
from China clusters with contemporary East Asian 
populations (44) suggests an even earlier date. 

Our results further suggest that the early stages 
of the western Eurasian lineage were already com- 
plex (see also Fig. 2). Besides its core affinities with 
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Fig. 4. Neandertal admixture in K14 and other ancient genomes. (A) Neandertal admixture proportions for the modern and ancient individuals from 
Eurasia. (B) Ancestry tract length distribution for tracts identified as Neandertal through a sliding-window approach. The sites are ascertained to be ancestral in 
the African populations. For each non-African, the tracts are identified as the regions where sites are derived in Neandertal and the individual shown in X. (C) 
The longest Neandertal haplotype identified in K14 through a sliding-window approach. Individuals were clustered using hierarchical clustering on the genotype 
matrix for the region. Missing data are shown in white: gray indicates homozygous ancestral, blue heterozygote-derived, and black homozygous-derived. 
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subsequent European groups, K14 also shares 
alleles with European Neolithic farmers and 
contemporary people from the Middle East/ 
Caucasus, which are not found in MAI and west- 
ern European MHGs, indicating genetic exchange 
between K14 and a Basal Eurasian Lineage (which 
eventually contributed to Neolithic groups) after 
the ancestors of MAI and subsequent European 
MHGs had diverged. This implies that early AMH 
populations became structured early in their his- 
tory, but in the UP already contained the major 
genetic components found in Europeans today. 
As such, our findings show the existence of a 
metapopulation structure in Europe from the 
Upper Paleolithic onward, remnants of which 
are still found today despite migrations to and 
from Europe since the UP. The early UP contri- 
bution is greater among northern than southern 
Europeans, in agreement with the southeast to 
west and north gene flow dine resulting from the 
expansion of Neolithic famers 9 to 6 ky cal B.P. 
{20, 43). However, descendants of the early UP 
population represented by K14 likely also con- 
tributed genes to western Siberian groups living 
around the mouth of the Yenisei River. Therefore, 
our findings support the view that these Uralic- 
speaking populations represent an ancient admix- 
ture between European and East Asian lineages. 
The recently proposed Holocene gene flow from 
East Asians into northern Europeans {21) can, in 
our view, be equally well explained by population 
structure of the hunter-gatherer metapopulation 
within Europe. As such, our results paint an in- 
creasingly complex picture of colonization histo- 
ry of Europe from the UP to today. Instead of 
inferring a few discrete migration events from Asia 
into Europe, we now see evidence that humans in 
Western Eurasia formed a large metapopulation 
with gene flow in multiple directions occurring 
repeatedly and perhaps continuously. 
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TETANUS TOXIN ENTRY 


Nidogens are therapeutic targets 
for the prevention of tetanus 

Kinga Bercsenyi,'’^ Nathalie Schinieg,^’^* J. Barney Bryson,^* Martin Wallace, 

Faola Caccin,^ Matthew Golding,^! Giuseppe Zanotti,^ Linda Greensmith,^ 

Roswitha Nischt,'*’ Giampietro Schiavo^^ 

Tetanus neurotoxin (TeNT) is among the most poisonous substances on Earth and a 
major cause of neonatal death in nonvaccinated areas. TeNT targets the neuromuscular 
Junction (NMJ) with high affinity, yet the nature of the TeNT receptor complex remains 
unknown. Here, we show that the presence of nidogens (also known as entactins) at the 
NMJ is the main determinant for TeNT binding. Inhibition of the TeNT-nidogen interaction 
by using small nidogen-derived peptides or genetic ablation of nidogens prevented the 
binding of TeNT to neurons and protected mice from TeNT-induced spastic paralysis. 

Our findings demonstrate the direct involvement of an extracellular matrix protein as a 
receptor for TeNT at the NMJ, paving the way for the development of therapeutics for the 
prevention of tetanus by targeting this protein-protein interaction. 


T etanus neurotoxin (TeNT) is composed of 
two subunits, which perform specific func- 
tions necessary for targeting this toxin to 
the central nervous system (CNS) and its 
high potency. The heavy (H) chain medi- 
ates high-affinity binding and entry into neu- 
rons, whereas the light (L) chain causes synaptic 
silencing of inhibitory interneurons, which nor- 
mally suppress motor neuron activity, and thereby 
induces spastic paralysis (1). After internaliza- 
tion into motor neurons, TeNT and the carboxyl- 
terminal fragment of its H chain (HcT) are sorted 
to signaling endosomes, which undergo axonal 
retrograde transport toward the motor neu- 
ron cell body (2). We have previously charac- 


terized the proteome of these organelles using 
HcT-conjugated magnetic nanoparticles (2). This 
afSnity purification approach identified a list of 
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proteins associated with signaling endosomes, 
some of which were predicted to be involved in 
the binding and/or internalization of HcT. We 
analyzed the sequences of these proteins for the 
presence of the tripeptide lyr-Glu-Trp (YEW), 
which was previously shown to interact with the 
sialic acid binding site (R site) of HcT, a region 
implicated in polysialoganglioside and acidic 
lipid binding (5, 4). The corresponding region 


of botulinum neurotoxins (BoNTs) interacts 
with the synaptic proteins SV2A-C or synapto- 
tagmin I/II (5-11), raising the possibility that the 
R site of TeNT also binds to a protein receptor. 

To identify putative TeNT receptors, we se- 
lected candidates containing variants of the YEW 
peptide, where this motif is present within the 
lumen of signaling endosomes, a localization 
topologically equivalent to the extracellular do- 


main of these proteins when localized to the 
plasma membrane (table SI). A total of 35 nine- 
residue peptides containing the YEW motif or 
closely related sequences were tested for bind- 
ing to HcT by using direct fluorescent binding, 
enz 5 Tne-linked immunosorbent assay (ELISA), 
and peptide pull-down assays (Fig. lA and fig. 
SI, A and B). The interacting peptides were then 
assessed for their ability to compete for HcT 



Peptides 


MW 

(kDa) 




• 


nidogen-1 

150 — 




HA-HJ 

50 — 






250 — 

150 — 



nidogen-2 

50 — 

- - 

- 

HA-HJ 



0 min 


10 min 


0 min 


10 min 


Fig. 1. Nidogen peptides bind to HcT and prevent its interaction with fuii- 
iength nidogens in vitro. (A) Assay for peptides that bind fluorescent HcT. 
The percentage of maximal binding was plotted for each peptide. The results 
were tested for significance between individual peptides and Tris-buffered 
saline (TBS) control (n = 3 independent experiments; *P < 0.05; error bar, 
SD). (B) Pull-down of HA-HcT preincubated with full-length recombinant 
nidogen-1 (top) or nidogen-2 (bottom). “Empty beads” corresponds to 
samples in which HA-HcT was omitted, and “DMSO" (dimethyl sulfoxide) 
to samples treated with vehicle control. Both nidogen-1 and -2 bind directly 
to HA-HcT and pretreatment with the N1 or N2 peptide blocks this 
interaction. In contrast, mutant peptides (NIaa and N2aa) are ineffective in 
preventing this interaction (n = 3 independent experiments for both 


proteins). The bands detected in the empty bead controls are due to the 
heavy chain of the capture antibody, which comigrates with HcT. (C and E) 
Vehicle control (HcT alone), Nl, or NIaa peptides were (C) preincubated, (E) 
added together (0 min), or added after a 10-min delay (10 min) to primary mouse 
motor neurons treated with AlexaFluor555-HcT in the absence or the presence of 
exogenous nidogen-1 (C) before fixing, immunostaining for pill tubulin, and 
imaging. Scale bar, (C) 50 gm; (E) 10 |im. (D and F) Quantification of the data 
shown in (C) and (E). Twenty images (D) or three-tile scans of 4 by 4 fields (F) 
were taken for each condition, and fluorescence intensity of AlexaFluor555-HcT 
bound to neurons as defined by a pill tubulin mask was quantified. Results were 
tested for statistical significance (n = 3 independent experiments; NS, 
nonsignificant; **P < 0.01; ***P < 0.005; ****P < 0.001; error bar, SD). 
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binding on primary motor neurons (fig. SIC). 
Peptides N1 and N2, which correspond to short 
sequences in nidogen-1 and nidogen-2, respec- 
tively (table SI and fig. SID), were selected for 
further investigation because they significantly 
reduced HcT binding to neurons (Fig. 1, C and D, 
and fig. SIC). The YEW-like motif of nidogen-1 
(YQW) and nidogen-2 (WSY) and their flanking 
residues were responsible for this interaction 
because single alanine mutants showed reduced 
binding to HcT (fig. SI, E and F), and the double 
alanine mutant (Maa) (table SI) failed to block 
the binding of HcT to motor neurons (Fig. 1, C 
and D). Furthermore, when the N1 peptide, but 
not NIaa, was added to motor neurons, either 
together with HcT or 10 min afterwards, it still 
significantly inhibited HcT binding to these neu- 
rons (Fig. 1, E and F). 

We then tested whether HcT interacts directly 
with full-length nidogens by incubating immobi- 
lized hemagglutinin (HA)-tagged HcT with full- 
length recombinant nidogen-1 or nidogen-2 and 
demonstrated that both proteins directly bind 
to HcT (Fig. IB). The N1 and N2 peptide se- 
quences of nidogen-1 and -2 played a key role 
In this interaction because preincubating HcT 
with an excess of these peptides, but not their 


respective mutants (table SI), abolished this bind- 
ing (Fig. IB). 

The two mammalian nidogens play partially 
overlapping roles in basement membrane (BM) 
formation and maintenance (12-14). Although 
nidogen-2 knockout (KO) mice do not display 
any overt phenotype, animals lacking nidogen-1 
develop a progressive hindlimb paralysis, indica- 
ting a role for this protein in motor neuron sur- 
vival (IS). Nidogens are tightly integrated within 
the BM, which renders them difficult to access 
for exogenous ligands. To determine whether the 
amount of cell surface-associated nidogens is the 
limiting factor for HcT binding to motor neurons, 
we added soluble full-length nidogen-1 together 
with HcT to cultured motor neurons. This re- 
sulted in the binding of exogenous nidogen-1 to 
motor neurons (fig. S2) and augmented the amount 
of cell surface-bound HcT (Fig. 1, C and D), sug- 
gesting that the addition of exogenous nidogen-1 
increased the number of HcT binding sites on 
the neuronal surface. This effect was dependent 
on the YEW-like motif of nidogen-1 because pre- 
incubation of HcT with the N1 but not the NIaa 
peptide prevented the increased cell surface 
binding of HcT (Fig. 1, C and D). Coimmunopre- 
cipitation of HcT and endogenous nidogen-2 


from motor neurons incubated with HcT for 
5 min at 37°C (fig. S3) suggested that the inter- 
action between nidogens and HcT is preserved 
upon entry of FIcT into endosomal compartments. 

Inspection of the domain structure of nidogens 
revealed that the N1 and N2 peptides reside 
within the globular G 2 domain (fig. S4A and 
movie SI). Molecular modeling (fig. S4, B to D) 
revealed that the N1 peptide, both in isolation 
and within the intact G 2 domain, is able to 
dock to the R site of HcT, where sialic acid, 
disialyllactose, and the YEW motif bind (4). The 
molecular surface concealed by the N1 peptide- 
HcT complex constitutes more than 40% of the 
total peptide surface (fig. S4C), suggesting that 
this interaction, which is driven by the potential 
formation of five hydrogen bonds and other polar 
Interactions (table S2), is very strong. 

The G2 domain of nidogen-1 fits into a large 
crevice at the top of the trefoil domain of HcT 
(fig. S4B and movie SI), concealing 1114 and 
1106 of the molecular surface of HcT and the 
G2 domain of nidogen-1, respectively. Complex 
formation is likely to be promoted by comple- 
mentary electrostatic surfaces and several poten- 
tial hydrogen bonds (table S3 and fig. S4D). To 
confirm the validity of this structural model. 




c 



Integrated intensity (H(,T) 


H,.T 


nidogen-2 


Merge 




Fig. 2. The nidogen-2 content of the neuromuscular 
junction correlates with HcT binding and uptake. 

(A and B) The anterior part of the lower hindlimb of adult 
mice was injected with HA-HcT. After 30 min, animals 
were transcardially perfused; the muscles were dissected, 
post-fixed, sectioned, and stained for HA-HCT, choline acetyl- 
transferase (ChAT), or nidogen-2 and for AlexaFluor555- 
a-bungarotoxin (a-BTx, in gray) to label NMJs. HcT (red) 
binds to the presynaptic site of the NMJ, where ChAT 
(green) is localized (A). Bright HqT puncta are found in 
regions where nidogen-2 (green) accumulates (B). Scale 
bar, 10 gm. (C) HcT and nidogen-2 fluorescence intensities 
were quantified in 50 NMJs. There is a significant correla- 
tion between the integrated signal intensity of both probes 
(P< 0.001, Spearman coefficient 0.923). (D) Primary motor neurons were incubated with AlexaFluor555-HcT and a rabbit antibody to nidogen-2 (for 45 min at 
37°C), acid-washed, fixed, and stained with an antibody to rabbit immunoglobulin G. HcT and nidogen-2 colocalization was quantified, and the corresponding 
Mander's coefficient is shown. Scale bar, 10 gm (n = 3 independent experiments). 
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we mutated several residues of the R site of 
HcT that are predicted to play an important 
role in the formation of this complex. As shown 
in fig. S5, we found that mutations of R1226, 
T1146, and Y1229, which form potential H-bonds 
with nidogen-1 (tables S2 and S3, respectively), 
as well as of other R site residues (P1212 and 
G1215) {IS), reduced the binding of both N1 and 
N2 peptides. Similarly, alanine scanning of the 
YEW-hke motif and flanking residues in the N1 
and N2 peptides strongly decreased their in- 
teraction with HcT (fig. SI, D to F), which strong- 
ly supported the validity of the model shown in 
fig. S4. This mode of interaction is likely to be 
conserved through evolution because the human 
N1 and N2 peptides show a high degree of ho- 
mology to the corresponding mouse sequences 
and also bound HcT (fig. SID). 

Crucially, this model predicts that the associ- 
ation between HcT and nidogen-1 is unlikely to 
interfere with the interaction between HcT and 
polysialogangliosides, which is required for TeNT 
binding to the neuronal surface (3, 17). Thus, 
nidogens and polysialogangliosides have the 
potential to bind TeNT simultaneously and there- 
by fulfill the requirements for the proposed dual 
receptor model {18, 19), which predicts the con- 
current interaction of lipid and protein receptors 
with TeNT and BoNTs for their high-affinity bind- 
ing to neuronal membranes. 

Nidogen-2 is highly enriched at the NMJ {20), 
and we sought to investigate whether it is lo- 
calized to the specialized pressmaptic BM of this 
structure. We thus injected the lower hindlimb 
of wild-type mice with HcT and then immuno- 
stained cryosections of the extensor digitorum 


longus (EDL) muscle for choline acetyltransferase 
(ChAT), a motor neuron marker, and nidogen-2. 
This analysis revealed that HcT binds to the 
presynaptic motor neuron nerve terminal, which 
is in close apposition to the postsynaptic acetylcho- 
line receptor, labeled by a-bungarotoxin (a-BTx) 
(Fig. 2A). Increased HcT binding was detected in 
areas where nidogen-2 was most abundant (Fig. 
2B and movie S2) and was absent Irom NMJs that 
did not express nidogen-2, which revealed a strong 
correlation between the nidogen-2 content and 
HcT binding at the NMJ (Fig. 2C). We then as- 
sessed whether these two proteins are internalized 
and transported together in signaling endosomes. 
HcT and an antibody against nidogen-2 were 
added to primary motor neurons at 37°C and 
allowed to internalize. After a mild acid wash 
to remove all probes still hound to the neuronal 
surface, the internalized a-nidogen-2 was de- 
tected by using a fiuorescently tagged secondary 
antibody. The extensive colocalization between 
a-nidogen-2 and HcT throughout the axonal 
network and cell bodies (66%) strongly indicated 
that these two proteins were internalized to- 
gether and cotransported by the same organelles 
(Fig. 2D). 

To further test the importance of nidogens 
in TeNT intoxication, we determined the sus- 
ceptibOity of nidogen KO mice to TeNT-induced 
paralysis and assessed the ability of HcT to bind 
neurons and tissues derived from these mice. Com- 
pared with wild-type motor neurons, nidogen-2 
KO motor neurons exhibited a significantly re- 
duced ability to bind HcT in vitro (Fig. 3, A and 
B). This was further verified in vivo by demon- 
strating that NMJ uptake of HcT was severely 


impaired in levator auris longus (LAL) muscles 
isolated from both nidogen-1 and nidogen-2 
KO mice (Fig. 3, C and D). In stark contrast, the 
capacity of the corresponding binding frag- 
ment of BoNT/A (HcA) to bind to nidogen null 
NMJ was largely unaffected (fig. S6A). Crucially, 
the addition of full-length recombinant nidogen-1 
restored the ability of HcT to bind NMJs lack- 
ing nidogen-1 (fig. S6B). This rescue was de- 
termined by the recruitment of recombinant 
nidogen-1 to mutant NMJs (fig. S6B), suggest- 
ing a replenishment of HcT acceptor sites at 
the synaptic cleft. 

In the genetic background used in our experi- 
ment, nidogen-1 and -2 double knockout (DKO) 
mice show early embryonic lethality {21), pre- 
cluding the derivation of motor neuron cultures. 
To overcome this limitation, we assayed HcT up- 
take into isolated hindbrains of DKO embryos 
taken at embryonic day 11.5 (E11.5). In contrast to 
control hindbrains, which displayed robust HcT 
staining, hindbrains from DKO littermates were 
unable to bind HcT (Fig. 3E), further indicating 
that nidogen-1 and -2 are essential for HcT bind- 
ing in vivo (Fig. 3F). 

Because the N1 peptide inhibited HcT binding 
to NMJs in whole LAL muscle preparations (Fig. 
4A), we investigated the ability of the N1 peptide 
to prevent uptake of TeNT in vivo, using three 
independent assays. First, full-iength TeNT was 
injected locally into the triceps surae muscle 
either alone or in combination with the N1 or 
NIaa peptides, and the effect on spastic paralysis 
was assessed in vivo by means of footprint anal- 
ysis 24 hours after injection (Fig. 4, B and C). Ad- 
ministration of TeNT alone caused severe gait 




Fig. 3. Neurons lacking nidogens display reduced binding to HcT. (A and B) 

Motor neurons derived from E13.5 wild-type and nidogen-2 knockout (KO) 
mouse embryos were incubated with HA-HcT (gray). Cells were fixed and 
stained for HA-HcT, and fluorescence intensity was quantified in 14 im- 
ages by using a pill tubulin mask (green). Scale bar, 20 gm. The value of 
nontreated cells was subtracted, and data were analyzed [(B), n = 3 
experiments— shown is a representative experiment; ****p < 0.001; error 
bar, SD], (C) LAL muscles of wild-type, and nidogen-1 and -2 KO animals were 


incubated with HA-HcT, fixed, and stained for HA (gray) and a-BTx (green) to 
identify NMJs. Scale bar, 5 gm. (D) Both nidogen-1 and -2 KO NMJs show a 
significant decrease in their ability to bind and internalize HcT (n = 3 animals 
per group, 20 images per animal; < 0.001; error bar, SD). (E) Nidogen-1 
and -2 DKO and wild-type E11.5 embryonic hindbrains were tested for HqT 
binding and uptake. Scale bar, 1 mm. (F) Mean intensity of HcT staining was 
measured and analyzed for significance (n = 3 animals per group; *P < 0.05; 
error bar, SD). 
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abnormalities, with the characteristic inability 
to place the affected hindpaw in the former po- 
sition of the ipsilateral forepaw, as occurs in nor- 
mal mice (Fig. 4, B and C). At a later stage, TeNT 
caused a permanent plantar-flexion of the af- 
fected hindpaw (movie S3). Coadministration of 
the N1 peptide completely abolished these TeNT- 
induced gait abnormalities (movie S3), whereas 
the NIaa peptide was ineffective (Fig. 4, B and C). 

To corroborate this finding, we quantified the 
protective effect of the N1 peptide by co-injecting 
it with TeNT and measuring the maximal force 
generated by the tibialis anterior (TA) muscle 
using in vivo isometric muscle tension physiol- 
ogy. The contractile force of the TeNT-injected 
muscle was diminished (3.11 ± 0.88% of non- 
injected control) (Fig. 4D and fig. S7A), as was 
the case for muscles co-injected with TeNT and 
the NIaa peptide (2.14 ± 1.44% of noninjected 
control) (Fig. 4D and fig. STB). In contrast, co- 
administration of the N1 peptide significantly 
abrogated the TeNT-mediated decline in con- 
tractile force compared with that of the other two 
groups (47.46 ± 10.31% of noninjected control) 
(Fig. 4D and fig. S7C). Similar results were ob- 
tained by using phrenic nerve-hemidiaphragm 
preparations (fig. STD) (22). Together, these re- 
sults confirmed the ability of the N1 peptide to 
prevent symptoms of tetanus in vivo. 

In order to assess the susceptibility of nidogen-2 
KO mice to systemic TeNT toxicity in vivo, we 


injected wild-tsqje and nidogen-2 KO mice in- 
traperitoneaUy with different doses of TeNT and 
monitored the appearance of tetanus. Nidogen-2 
KO animals showed a significant delay in the 
onset and progression of tetanic symptoms at 
lower TeNT doses (Fig. 4E). Unfortunately, the 
possibility to extend these assays to mice lack- 
ing nidogen-1 was precluded hy the severe ep- 
ileptic and paralytic phenotype affecting these 
animals (15). 

Some of the symptoms displayed by nidogen-2 
KO mice injected with TeNT, such as kerato- 
conjuctivitis sicca (23), resembled those of bot- 
ulism. This suggests that in the absence of one 
of its physiological receptors, a proportion of 
TeNT may exploit alternative routes of entry at 
the NMJ, which are shared with BoNTs. Cross- 
talk between the two modes of entry and intra- 
cellular transport pathways taken by TeNT and 
BoNTs has been previously suggested (24) and 
is further supported by the botulism-like symp- 
toms shown by mice injected with high doses 
of TeNT (25). 

Because BoNTs use several SV2 isoforms as 
receptors (9-11, 22) and SV2A has been pro- 
posed as a TeNT receptor in central neurons (26), 
we quantified the colocalization of HcT with 
SV2A and SV2C in motor neurons and tested 
whether preincubation of HcT with recombi- 
nant nidogen-1 modifies the association of HcT 
with these synaptic vesicle proteins. In the ab- 


sence of exogenous nidogen, SV2A showed ro- 
bust colocalization with HcT internalized for 
45 min at 37°C (46%) (fig. S8, A and B), whereas 
the codistribution of HcT with SV2C under these 
conditions was relatively low (fig. S8, C and D). 
Therefore, at least a subpool of TeNT may be 
internalized via ssmaptic vesicle recycling at the 
relatively high concentrations used in this assay. 
However, the addition of recombinant nidogen-1 
caused a significant reduction in the colocal- 
ization of HcT with SV2A (22%) (fig. S8, A and 
B) but not with SV2C (fig. S8, C and D). However, 
despite the addition of recombinant nidogen-1 
causing an increased intracellular pool of HcT, the 
proportion of this probe colocalizing with SV2A 
was significantly reduced (fig. S8, A and B). Taken 
together, these data suggest that in the presence 
of nidogen-1, HcT is preferentially targeted to 
an endocytic pathway linked to axonal retro- 
grade transport to the cell body and is not taken 
up by ssmaptic vesicle recycling. 

We have identified the protein receptor respon- 
sible for TeNT binding and internalization at the 
NMJ as well as a potential mechanism to explain 
the overlap between the entry routes of TeNT 
and BoNTs (fig. S9). Engineered growth factors 
with higher affinity for the extracellular matrix 
(ECM) display enhanced biological activity (27), 
indicating that differential binding of such lig- 
ands to ECM components is crucial for their 
signaling and biological function (28). TeNT 


.A 
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Fig. 4. The N1 peptide biocks tetanic paraiy- 
sis by inhibiting TeNT binding at the NMJ. 

(A) LAL muscles were incubated with HA-HcT, 
which had been pretreated either with vehicle 
control (HcT alone), the N1 peptide, or the NIaa 
mutant in motor neuron culture medium. Mus- 
cles were then immunostained for HA and a-BTx 
to identify NMJs. The ratio of HcT-positive to 
HcT-negative NMJs was quantified and analyzed 
for significance (n = 3 independent experiments, 
six images per condition; NS, nonsignificant; **P < 

0.01; error bar, SD). (B) Footprint analysis of con- 
trol mice, or of mice after injection into the tri- 
ceps surae muscle of a sublethal dose of TeNT, 
or TeNT preincubated with the NIaa or N1 pep- 
tides. Black circles show the paw prints from the 
noninjected left hindlimb and right frontlimb, 
and red circles mark the paw print of the TeNT- 
injected right hindlimb. In control and TeNT-rNT 
injected mice, the black and red circles overlap, 
whereas in TeNT- and TeNT-rNlAA-injected mice, 
they are separated, indicating impairment in co- 
ordination due to local tetanus. (C) The distance 
between the paw prints on the injected side was 
measured and tested for statistical significance 
(n = 2 independent experiments; data from two 
animals are shown; NS, nonsignificant; *p < 0.05; 

**p < 0.01; error bar, SD). (D) The TA muscle was injected with TeNT 
preincubated with either DMSO (vehicle control), NIaa, or N1 peptides. After 
anesthesia, the tendon of this muscle was attached to a tension sensor, and 
the peroneal nerve was stimulated. Maximal muscle force was measured, 
normalized to muscle weight, and tested for statistical significance in 
control versus injected muscle. The Nl-treated group retained a significantly 
higher muscle force than that of the vehicle- or NlAA-treated groups, (n = 6 
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mice per group; **P < 0.01; error bar, SEM). (E) Age- and weight-matched 
nidogen-2 KO and wild-type mice were injected intraperitoneally with TeNT 
and monitored for up to 96 hours. The time of termination due to loss of 
righting reflex was plotted against the TeNT dose. Generalized tetanus in 
nidogen-2 KO animals was delayed at lower doses (1.5 ng/kg) compared 
with that in wild-type controls (n = 2 experiments, minimum two mice per 
group; **P < 0.01; error bar, SD). 
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might exploit a similar strategy based on a very 
efficient capture mechanism at specialized NMJ 
sites rich in nidogens, which may function to 
concentrate TeNT as well as physiological lig- 
ands, such as neurotrophic factors, to facilitate 
their uptake and sorting to axonal transport or- 
ganelles. At these sites, TeNT in complex with 
nidogens may interact with surface receptors 
known to bind nidogens, such as the protein phos- 
phatase LAR (29, 30). This specialized capture 
mechanism is likely to be indispensable to the 
host cell, and this enables TeNT to be lethal at 
extremely low concentrations. Our study sug- 
gests that nidogens are prime therapeutic targets 
for suppressing the uptake of TeNT at the NMJ 
and its access to the CNS, preventing its lethal 
effects. 

REFERENCES AND NOTES 

1. C. Montecucco, Curr. Top. Microbiol. Immunol. 195, 278 
{1995). 

2. K. Deinhardt ef a/., Neuron 52, 293-305 (2006). 

3. C. Chen, Z. Fu, J. J. P. Kim, J. T. Barbieri, M. R. Baldwin, 

1 Biol. Chem. 284, 26569-26577 (2009). 

4. S. Jayaraman, S. Eswaramoorthy, D. Kumaran, S. Swaminathan, 
Proteins 61, 288-295 (2005). 

5. Q. Chai ef a/., Nature 444, 1096-1100 (2006). 

6. R. Jin, A. Rummel, T. Binz, A. T. Bmnger, Nature 444, 
1092-1095 (2006). 

7. R. P. A. Berntsson, L. Peng, L. M. Svensson, M. Dong, 

P. Stenmark, Structure 21, 1602-1611 (2013). 

8. R. P. A. Berntsson, L. Peng, M. Dong, P. Stenmark, Nat. 
Commun. 4, 2058 (2013). 

9. M. Dong ef a/.. Science 312, 592-596 (2006). 

10. L. Peng, W. H. Tepp, E. A. Johnson, M. Dong, P/.0S Pathog. 

7. el002008 (2011). 

11. R. M. Benoit ef a/.. Nature 505, 108-111 (2014). 

12. K. Mann ef al.. EMBO J. 8, 65-72 (1989). 

13. M. S. Ho, K. Bose, S. Mokkapati, R. Nischt, N. Smyth, 

Microsc. Res. Tech. 71, 387-395 (2008). 

14. J. Kruegel, N. Miosge, Cell. Mol. Life Sci. 67, 2879-2895 
( 2010 ). 

15. L. Dong ef al., Lab. Invest. 82, 1617-1630 (2002). 

16. A. Rummel, S. Bade, J. Alves, H. Bigalke, T. Binz, J. Mol. Biol. 
326, 835-847 (2003). 

17. T. Binz, A. Rummel, J. Neurochem. 109, 1584-1595 
(2009). 

18. C. Montecucco, Trends Biochem. Sci. 11, 314-317 (1986). 

19. A. Rummel ef al., Proc. Natl. Acad. Sci. U.S.A. 104, 359-364 
(2007). 

20. M. A. Fox, M. S. Ho, N. Smyth, J. R. Sanes, Neural Dev. 3, 

24 (2008). 

21. B. L. Bader ef al.. Mol. Cell. Biol. 25, 6846-6856 (2005). 

22. S. Mahrhold, A. Rummel, H. Bigalke, B. Davletov, T. Binz, 

FEBS Lett. 580, 2011-2014 (2006). 

23. 0. Suwan-apichon ef al., Invest. Ophthalmol. Vis. Sci. 47, 
133-139 (2006). 

24. L. Restani ef al.. PLOS Pathog. 8, el003087 (2012). 

25. M. Matsuda, N. Sugimoto, K. Ozutsumi, T. Hirai, Biochem. 
Biophys. Res. Common. 104, 799-805 (1982). 

26. F. L. Yeh ef al., PLOS Pathog. 6, el001207 (2010). 

27. M. M. Martino ef al.. Science 343, 885-888 (2014). 

28. L. Macri, D. Silverstein, R. A. Clark. Adv. Drug Deliv. Rev. 59, 
1366-1381 (2007). 

29. E. Stryker, K. G. Johnson, J. Cell Sci. 120, 3723-3728 
(2007). 

30. P. O'Grady, T. C. Thai, H. Saito, J. Cell Biol. 141, 1675-1684 
(1998). 

ACKNOWLEDGMENTS 

We thank S. Kjaer (Cancer Research UK London Research 
Institute) for help with the Octet RED96 system and Kd 
determination. N. O'Reilly and members of the Protein and Peptide 
Chemistry Laboratory (Cancer Research UK London Research 
Institute) for peptide synthesis, purification, and analysis: 

F. Giribaldi (Cancer Research UK London Research Institute) for 
help with footprint analyses: K. Deinhardt (University of 
Southampton) for mass spec analysis of axonal signaling 


SCIENCE sciencemag.org 


endosomes: S. Swaminathan (Brookhaven National Laboratory) for 
experiments aiming at the crystallization of the N1 peptide-HcT 
complex: A. Rummel and T. Binz (Medizinische Hochschule 
Hannover) and N. Fairweather (Imperial College London) for 
providing access to published HcT mutants: I. Koxholt (University 
of Cologne) for help with nidogen-1 and -2 KO mice: and 
J. N. Sleigh (University College London) for help with statistical 
analyses. We also thank C. Montecucco (University of Padua) 
and S. Novoselov (University College London) for constructive 
comments. This work was supported by Cancer Research UK (K.B., 
N.S., M.G., M.W., and G.S.), by the Medical Research Council 
(M.W.), by the University of Padua (P.C. and G.Z.), and by the 
Deutsche Forschungsgemeinschaft (SFB 829) through the 
Collaborative Research Center at the University of Cologne and 
the "Koln Fortune Programm" (R.N.). LG. is the Graham Watts 


Senior Research Fellow, supported by the Brain Research Trust. 
The data presented in this manuscript are found in the main paper 
and the supplementary data and materials and methods. The 
authors declare that they have no conflict of interest. 

SUPPLEMENTARY MATERIALS 

www.sciencemag.org/content/346/6213/1118/suppl/DCl 

Materials and Methods 

Figs. SI to S9 

Tables SI to S4 

References (31-38) 

Movies SI to S3 

30 June 2014: accepted 5 November 2014 
10.1126/science.l258138 


T CELL SIGNALING 


Antigen affinity, costimulation, and 
cytokine inputs sum linearly to 
amplify T cell expansion 

Julia M. Marchingo,'’^ Audrey Kan,^’^ Robyn M. Sutherland, Ken R. Duffy,® 

Cameron J. Wellard,^’® GabrieUe T. Belz,'’® Andrew M. Lew,^’® Mark R. Dowling,'’^’* 
Susanne Heinzel,^’®* Philip D. Hodgkin'’®*t 

T cell responses are initiated by antigen and promoted by a range of costimulatory signals. 
Understanding how T cells integrate alternative signal combinations and make decisions 
affecting immune response strength or tolerance poses a considerable theoretical challenge. 
Here, we report that T cell receptor (TCR) and costimulatory signals imprint an early, 
cell-intrinsic, division fate, whereby cells effectively count through generations before returning 
automatically to a quiescent state. This autonomous program can be extended by cytokines. 
Signals from the TCR, costimulatory receptors, and cytokines add together using a linear 
division calculus, allowing the strength of a T cell response to be predicted from the sum of the 
underlying signal components. These data resolve a long-standing costimulation paradox and 
provide a quantitative paradigm for therapeutically manipulating immune response strength. 


U pon infection, pathogen-specific CDS"^ T cells 
undergo a characteristic kinetic sequence: 
rapid proliferation and expansion followed 
by population contraction due to ceU death 
(i). While short-term stimulation is suf- 
ficient to trigger T ceU proliferation (2, 3), 
further exposure to stimulatory signals is required 
for an effective response (2, 4, 3). Although mul- 
tiple attempts have been made to create a theory 
relating the integration of stimulatory signals to 
T ceU response strength, all have been qualitative 
(6) and thus have lacked the power to predict the 
quantitative effect of altering stimulatory combi- 
nations and strength. The current qualitative par- 
adigm describes T cell activation and response 
magnitude as the outcome of three requisite sig- 
nals: signal 1, T cell receptor (TCR) (i); signal 2, 
membrane-bound antigen-presenting ceU (APC)- 
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delivered costimuli (4); and signal 3, cyrtokines 
from inflammatory, homeostatic, or autocrine 
sources (5). The importance of these signals for 
T cell expansion is highly context-dependent, as 
classic in vitro studies have identified many “crit- 
ical” signal 2 and 3 molecules (5, 7-9); however, 
gene deletion typically yields only moderate de- 
fects in the in vivo CDS"'" T cell response (10-13), 
implying considerable redundancy. 

Recent studies in B cells report an automated 
return to quiescence after a series of division 
rounds (14-16). The number of mitotic cycles B 
cells undergo varies and is influenced by the 
strength of stimulation. We hyrpothesized that 
T cells might be programmed in a similar man- 
ner with the final number of divisions (A/) (Fig. lA, 
left panel) a function of the sum of inputs from 
signals 1, 2, and 3 (illustrated in Fig. lA). If cor- 
rect, it may be possible to determine the calculus 
of addition to serve as the basis for a quantitative 
framework for T cell costimulation. 

To test this hypothesis, we first measured the 
onset of quiescence in CDS* T cells using TCR- 
transgenic OT-I mice [which recognize chicken 
ovalbumin SIINFEKL (N4) peptide bound to 
H2K’^ crossed with FucciRG mice in which cells 
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fluoresce red (FucciR) during Go/Gi and green 
(FucciG) for the duration of the S/G 2 /M cell 
cycle phases (27). Quiescent (FucciR'^G"), recently 
divided (FucdR“G“), and actively dividing (FucciG"^) 
cells can be distinguished, because cells that have 
reverted to a quiescent state (Go) accumulate high- 
er levels of FucciR (27, 18). We define the number 
of generations of division before returning to 
quiescence as the cell’s division destiny (DD) 
(14-16, 19). OT-I/FucciRG CDS"’’ T cells were 
transferred into mice infected with recombinant 
HKx31 influenza virus expressing N4 (HKx31-N4) 
(20). During early expansion, most OT-I/FucciRG 
CDS* T cells were proliferative, with <10% revert- 
ing to a quiescent state by day 3 of the response. 
This proportion of quiescent cells increased stead- 
ily, reaching -75% of aU OT-I/FucciRG CD8'’' T ceUs 
by the onset of contraction at day 7 (Fig. 1, B and C). 
To estimate the number of divisions T cells under- 
went before dropping out of cycle, the Cyton model 
(15, 21) was fitted to total and quiescent cell num- 
bers (Fig. ID and table SI). Results were consistent 
with a T cell DD range spanning -10 genera- 
tions (Fig. IE). If DD is carried through each cell 
lineage (16), this result predicts up to 1000-fold 
differences in T cell family size consistent with 
recent single-cell tracking studies (22, 23). 

To further explore the regulation of division 
progression, we developed a minimal in vitro stim- 
ulation system using CeUTrace Violet (CTV)-labeled 
OT-I/FucciRG CDS’’’ T cells. The contribution of 
signals 2 and 3 was reduced by using peptide self- 
presentation by purified CD8'’' T cells (24). The 


strong effect of autocrine interleukin-2 (IL-2) was 
controlled by adding blocking antibody (clone 
S4B6) and using human IL-2 (hIL-2), resistant to 
S4B6, when required (25). Superficially the in vitro 
pattern of early proliferation with a gradual on- 
set of quiescence recapitulated the in vivo re- 
sponse (Fig. 1, C and F), with the major differences 
being the DD and subsequent time to die (Fig. 1, 
D, E, and G, and table SI). 

We speculated that the three known sources of 
regulation— TCR affinity, costimuh, and cytokines— 
might combine to convert the low DD observed 
in vitro into the extensive outcome possible in vivo 
(Fig. IE). 

To improve estimation of mean DD (mDD) 
in our in vitro CTV division tracking assay, we 
used OT-I CDS"’’ T cells deficient in the pro- 
apoptotic molecule Bim (OT-l/Bcl2lir^~) for all 
experiments. These cells reported the same mDD 
as OT-l/Bd2Ur^* CDS'" T ceUs (Fig. 2A; fig. SI, A 
to C; and table S2), but the enhanced survival 
upon reverting to quiescence facilitated DD mea- 
surement at later times, consistent with previous 
studies in B cells (15, 16). 

Fig. S2, A to F, shows the effect of a range of 
T cell stimuli on mDD. TCR affinity, several ago- 
nists representative of cell-contact-mediated co- 
stimulation, and some, but not aH, cytokines tested 
were able to regulate DD in a dose-dependent 
manner. To determine when DD was most sus- 
ceptible to regulation, cells from cultures where 
stimulation was removed Immediately before the 
first division (24) were compared to cells with 


constant costimulation. Agonist antibodies to 
CD28 and CD27 principally acted before the 
first division. In contrast, ongoing exposure to 
IL-2, IL-4, and IL-12 was required for maximal 
proliferation (Fig. 2B). Higher levels of IL-2 or 
IL-4 caused T cells to divide beyond CTV res- 
olution (fig. S3) and the culture capacity. There- 
fore, to investigate the potential of IL-2 and IL-4 
to extend DD when cytokine levels were main- 
tained, OT-I/Bc22Z22“^“ CDB"" T cells were sub- 
cultured every 48 hours in hIL-2 (Fig. 2C) or IL-4 
(fig. S4A), and total cell numbers were calculated 
using splitting ratios. Cyton fitting revealed that 
hIL-2 and IL-4 can increase the mDD by up to 
~11 and 7 divisions, respectively (Fig. 2, C to E; 
fig. S4, A and B; and tables S3 and S4). Titration 
of hIL-2 showed the effect on mDD to be dose 
dependent (Fig. 2D) and that this increase in 
mDD was associated with an increase in var- 
iance (Fig. 2E and fig. S4C). Together these re- 
sults demonstrated that DD can be intrinsically 
programmed by early signals, but also has the 
fiexibihty to be “reprogrammed” or extended by 
extrinsic stimuh as the T cells divide. 

We then determined how T cells integrated 
multiple contributors to DD. In Fig. 3, A and B, 
we show the increase in mDD imprinted before 
the first division for low concentrations of CD28 
and CD27 agonist antibodies, and IL-12 protein. 
The combination of antibodies to CD28 and CD27 
programmed an mDD that was equivalent to the 
sum of each individual effect, with IL-12 giving a 
slightly greater than additive increase in mDD 


Fig. 1. CDS'" T cells undergo a program of prolifer- 
ation and quiescence in vivo and in vitro. (A) Quan- 
titative T cell expansion hypothesis. By this model, 
the number of mitotic cycles a T cell undergoes after 
activation (N) varies and is determined by a sum of 
the individual inputs it receives. In the example shown, 
signal 1, 2, and 3 stimuli each individually elicit a small 
increase in mean population division number. The cu- 
mulative effect of these contributions, when summed 
linearly, would lead to geometric increases in total cell 
number at the peak response. Analysis of OT-I/FucciRG 
CDS'" T cells [(B) to (E)] transferred to HKx31-N4- 
infected recipients on day 2 after infection or [(E) to 
(G)] in vitro stimulated with N4 in the presence of mlL-2 
blocking antibody (S4B6). (B) Number and (C) per- 
centage of OT-I/FucciRG CDS'" T cells expressing FucciR 
and FucciG reporter proteins pooled from mediastinal 
lymph node, spleen, and lungs at the indicated time 
points after transfer. (D) Fitted total (black) and quiescent 
(red, FucciR'"G“ -r small FucciR^G^ cells) cell numbers 
using the Cyton model (29). (E) The estimated DD 
distribution from Cyton fitting to in vivo (D) and in vitro 
(G) stimulated cells. (F) Percentage of FucciR and FucciG 
expression. (G) Fitted total (black) and quiescent (red, 
FucciR'"G“ -r small FucciR^G^ cells) cell numbers ( 19 ). 
[(B) to (E)] n = 5 to 10 mice per time point, pooled 
from two independent experiments: mean + SEM. [(E) 
to (G)] Representative of three independent experi- 
ments; mean + SEM from triplicate culture wells. 
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(Fig. 3, C and D). Importantly, no single “second 
signal” appeared obligatory, but rather multiple 
small arithmetic effects on DD culminated in 
large geometric differences in the cell numbers 
produced (Fig. lA, center and right panels). Thus, 
an increase of ~2.2 divisions in mDD (Fig. 3D) 
with the accompanying -0.5 division increase in 
standard deviation (fig. S5, A to D, and table S5) 
summed from three weak costimuli resulted in 
a net ~8-fold increase in the peak cell number 
(Fig. 3, E and F), with the additional difference 

Fig. 2. Costimuli and cytokines program changes to 

DD. (A) CTV-labeled 0J-\/Bcl2lir''* or OT-l/Bcl2/ir''“ 
CDS"^ T cells were stimulated with N4: total cell num- 
bers (left), mean division number (middle), and an es- 
timation of the percentage of the starting cells whose 
progeny are contributing to the response at that time 
point, calculated by removing the effect of cell expan- 
sion at each time point [percent cohort number, right; 
as described in (19)] were determined. Mean DD (mDD) 
on each graph is indicated by dotted lines. (B) Mean 
division number of CTV-labeled 0J-\/Bcl2lir^~ CDS"^ T 
cells cultured in the presence (-r, solid lines) or absence 
(-, dotted lines) of antibodies to CD28 (2 gg/mL) and 
CD27 (immobilized, 5 jrg/mL) (blue), IL-12 (1 ng/mL), 
human IL-2 (hlL-2, 31.6 U/mL), or IL-4 (1000 U/mL) 
(red) for 26 hours, washed and further cultured with 
(dark) or without (light) costimulation. (C) CTV-labeled 
OJ-\/Bcl2lir''~ CDS* T cells stimulated with N4 in the 
presence of hlL-2 at the indicated concentrations were 
subcultured into fresh hlL-2 every -48 hours and the 
cell number was fitted using the Cyton model (dotted 
lines) (19). (D and E) Dose-response curve of mDD (D) 
and DD distributions (E) from Cyton fitting in (C). All 
cultures contained 25 |xg/mL S4B6. Representative of 
at least two independent experiments: mean + SEM 
from triplicate culture wells. 


Fig. 3. Summation of DD from muitipie costimuii 
geometricaiiy ampiifies the T ceii response. Per- 
centage cohort number (Fig. 2A) (19) versus mean 
division number for N4-stimulated, CTV-labeled OT- 
\/Bcl2lir^~ CD8* T cells cultured with (A) antibodies 
to CD27 (immobilized, 5 jrg/mL) and CD28 (2 gg/mL) 
or (B) IL-12 protein (1 ng/mL) alone, (C) antibodies to 
CD27 and CD28 together, or (D) a combination of all 
three costimuli for 26 hours, washed and recultured 
without further stimulation. Relationship between 
cell number and either (E) mean division number or 
(F) time for data in [(A) to (D)]. Arrows represent the 
effect of individual stimuli on mDD. All cultures con- 
tained S4B6 at 25 gg/mL. Graphs are representative 
of three independent experiments: mean + SEM of 
triplicate culture wells. 


in response magnitude attributable to small 
variations in the starting cell number (fig. S5E 
and table S5). Early programming was cell 
intrinsic because cells imprinted with an mDD of 
~1 or ~3.4 generations gave the same outcome 
irrespective of whether they were subsequently 
cultured separately or together (fig. S6, A and B). 
The approximately additive effect of stimuli on 
DD also applied for a range of combinations when 
stimuli were retained in culture during subse- 
quent division rounds (fig. S7, A to D). 


Taken together, this series of experiments re- 
veals two stages of regulation of T cell DD. In the 
first stage, signal 1 and a series of signal 2 and 3 
stimuli of different strengths and combinations 
can additively “program” a heritable number of 
division rounds before the first cell division. In 
the second stage, exposure to external signals, 
mainly cytokines, can be processed and added to 
the DD. These features are consistent with a mo- 
lecular mechanism whereby each stimulatory sig- 
nal contributes a quantum of mitosis-promoting 
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Fig. 4. In vivo summation of DD can predict T cell 
expansion. (A) Ratio of 0T-l///2ra'^'^^:0T-l///2ra“^“ CDS"^ 

T cells recovered from mLN, spleen, and lungs after 
cotransfer of equal numbers of each into HKx31-N4- 
infected recipients. Mean + SEM, n = 5 to 10 mice per 
time point, pooled from two independent experiments; 
two-way analysis of variance (ANOVA). (B and C) 

CTV-labeled OT-l/H2ra'"^'" and OT-l///2ra^''“ CDS'" T 
cells were co-transferred to recipient mice at a 30:70 
ratio 1 day before engraftment under the renal capsule 
with islets expressing membrane-bound ovalbumin 
under the rat insulin promoter. Mice were pulsed with 
BrdU 1 hour before organ harvesting on day 6 after 
engraftment, and the ratio of 0T-l///2ra'^'''^:0T-l///2ra“^“ 
of (B) divided CDS"^ T cells and (C) the percentage 
of divided CDS"^ T cells positive for ErdU"^ within their 
respective genotype populations was measured in 
the renal (draining) lymph node (rLN) and graft. 

Mean + SEM, n = 4 mice, representative of two inde- 
pendent experiments; two-way ANOVA and one-tailed t 
test, respectively. Eor all experiments, data points were 
excluded from ratio and percentage calculations when 
less than 100 cells were detected. Dotted lines in (A) and 

(B) represent the 0T-l///2ra'*‘'^'*':0T-l///2ra“^“ transfer ratio. (D) The Cyton model 
was fitted to OT-l///2ra'^^* and 0T-l///2ra“^“CD8'^ T cell numbers from HKx31-Q4- 
infected mice and OT-l///2ra“^“ CDS"^ T cells from HKx31-N4-infected mice pooled 
from the mLN, spleen, and lungs after cotransfer of equal numbers of cells into 
recipient mice, and (E) the division destiny distribution was determined (top) (29). 
By summation of mean and variance of these DD distributions, the cumulative 
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effect of increasing TCR-antigen affinity and IL-2 signaling on the DD distribution 
(i.e., the DD distribution for OT-l/W2ra*^'* CDS"^ T cells in HKx3TN4-infected mice) 
was predicted (bottom). (F) Using this mean and variance, the cell number over 
time was predicted for 0T-l///2ra*'''^ CDS'^ T cells in HKx3TN4-infected mice (19). 
Mean ± SEM; n = 5 mice per time point, representative of two independent 
experiments. ****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05. 


protein or complex that is diluted by division until 
a submitotic concentration is reached and division 
ceases (26). 

The two-stage DD programming model makes 
two key predictions for the role of extrinsic fac- 
tors, such as lL-2, that could be tested during in 
vivo CDS"^ T cell responses: (i) the major phys- 
iological role of autocrine lL-2 is in maintaining 
division, and therefore it will be more important 
away from the initial site of CDS"^ T cell priming; 
and (ii) the effect of lL-2 on DD will sum with 
other stimuli, allowing the prediction of CDS"^ T 
cell expansion kinetics when IL-2 and other 
stimuli are combined. 

We tested the first prediction by comparing the 
expansion of ID2 receptor a-deficient OT-I CDS"^ T 
cells (OT-l/I!2ra~^~) with CDS'" T ceUs 

in two different in vivo systems, namely an anti- 
influenza response and islet graft rejection mod- 
el. Similar numbers of OT-lf Il2ra*''* and OT-J/ 
n2ra~^~ CDS'" T cells were detected at the site of 
priming [mediastinal lymph node (mLN)] during 
the expansion phase when cotransferred into 
HKx31-N4 infected mice (Fig. 4A). In contrast, a 
bias toward expansion of OT-lin2ra*^* CDS'" T 
cells was observed in the spleen and lungs, con- 
sistent with a role for IL-2 in the maintenance of 
cell expansion (Fig. 4A). OT-I/iZ2ra'"'^'" CDS'" T ceUs 
also outcompeted CDS'" T cells at the 

effector site during an anti-islet graft response 
(Fig. 4B). The proportion of bromodeoxyuridine- 
positive (BrdLT") OT-l/7Z2ra'"^'" CDS'" T cells after 
a 1-hour in vivo BrdU pulse was ~2.5 times as 
high as for OT-I/n2ra“'^“ CDS'" T cells in the graft 
(Fig. 4C), confirming that this bias was attribut- 
able to proliferation in the effector site and not 
due to migration alone (27). 


To investigate the additive nature of T cell 
stimuli in vivo, OI-l/I!2ra*^* and CDS'" 

T ceUs were cotransferred into mice infected with 
either high-affinity (HKx31-N4) or low-affinity 
(HKx31-Q4) influenza virus (20). The Cyton model 
was fitted to T ceU numbers to estimate the in- 
crease in DD due to TCR affinity or IL-2 signaling 
alone (Fig. 4, D and E, top, and table S6). By 
summation of these individual contributions to 
the mean and variance of the DD distribution, we 
predicted the effect of a combined increase in TCR 
affinity and IL-2 signaUng on DD (Fig. 4E, bottom) 
and successfully recreated the expansion kinetics 
of OT-I/7Z2ra'"'^'" CDS'" T cells during an HKx31- 
N4 infection (Fig. 4F). 

Manipulating T cell responses by costimulation 
and csTokine signaUng is an important emerging 
therapeutic regimen (28, 29), and a quantitative 
framework will facilitate the rational develop- 
ment of optimal interventions. To support this 
goal, we propose a quantitative paradigm where 
no one signal is obligatory but rather combina- 
tions of inputs add together to geometricaUy en- 
hance outcomes (Fig. lA). Thus, combinations of 
different costimulatory and csTokine signals pro- 
vide many alternative paths to generate T cell 
responses of similar magnitude. This framework 
reconciles long-standing discrepancies between 
in vivo and in vitro results for IL-2 and costim- 
ulatory signals and reveals a quantitative basis 
for current switch-inspired two- and three-signal 
models of activation. Further studies measuring 
simultaneous differentiation changes to effector 
and memory states associated with cell division 
would complete the T cell calculating framework 
and further enhance our ability to predict thera- 
peutic strategies for immunomodulation. 
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Mechanism of DNA interstrand 
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nuclease FANl 
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DNA interstrand cross-links (ICLs) are highly toxic lesions associated with cancer and 
degenerative diseases. ICLs can be repaired by the Fanconi anemia (FA) pathway and 
through FA-independent processes involving the FANl nuclease. In this work, FANl-DNA 
crystal structures and biochemical data reveal that human FANl cleaves DNA successively 
at every third nucleotide. In vitro, this exonuclease mechanism allows FANl to excise an ICL 
from one strand through flanking incisions. DNA access requires a 5'-terminal phosphate 
anchor at a nick or a 1- or 2-nucleotide flap and is augmented by a 3' flap, suggesting that 
FANl action is coupled to DNA synthesis or recombination. FANl's mechanism of ICL 
excision is well suited for processing other localized DNA adducts as well. 


D NA interstrand cross-links (ICLs) covalent- 
ly link together the two strands of the 
double helix (I). ICLs can he repaired by 
the Fanconi anemia (FA) pathway of pro- 
teins mutated in the FA cancer predispo- 
sition syndrome (2) and also by FA-independent 
processes (2, 3, 4). The FA pathway is activated 
when a replication fork stalls upon encounter- 
ing an ICL, leading to unhooking of the ICL 
through flanking incisions on one strand, trans- 
lesion DNA synthesis across the unhooked ICL, 
removal of the ICL remnants by additional in- 
cisions, and homologous recombination (2, 5). 
The FANl nuclease (6-9) is a candidate for me- 
diating FA-independent repair (1, 3, 4). Although 
FANl mutations result in defective ICL repair, 
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chromosomal instability, and hypersensitivity to 
a wide range of ICL-inducing agents (6-13), they 
do not cause FA (10-12). Instead, they cause a kid- 
ney degeneration disorder known as karyomegalic 
interstitial nephritis (11). FANl cleaves branched 
DNA structures with a preference for 5'-flap sub- 
strates and also exhibits 5'- to 3'-exonuclease activ- 
ity on a variety of double-stranded DNA (dsDNA) 
and single-stranded DNA (ssDNA) substrates (6-9). 
The functional implications of these broadly de- 
fined activities, whether FANl processes ICLs di- 
rectly and in which manner, and the structural 
mechanism of ICL unhooking by nucleases are 
unknown. 

To address these questions, we first inves- 
tigated FANl’s DNA binding specificity for var- 
ious lengths of 5' flaps consisting of thymidine 
nucleotides (see supplementary materials and 
methods). We found that FANL exhibits spec- 
ificity for a 5' phosphate group at a flap of 1 to 2 
nucleotides (nts) or at a nick, with dissociation 
constants (K^ of 10.3 nM for the 1-nt flap (S'pTj), 
121 nM for 2 nts (5'pT2), and 182 nM for the nick 
(pNi) (fig. SLA). The corresponding substrates 
lacking the 5' phosphate group fail to bind ap- 
preciably until micromolar FANl concentrations 
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(fig. SIA). Similarly, increasing the flap length 
beyond 2 nts reduces FANl affinity substantial- 
ly (fig. SIA). We also tested the effect of adding a 
3' flap of eight thymiidine nucleotides to the high- 
affinity 5'pTi (5'pTi/3T8), S'pTa (S'pTa/STg), and 
5'pNi (5'pNi /3T8) and found that this increased 
their FANl affinity by a factor of -25 to -180 
(K^ values of 0.4, 1.4, and 1.0 nM, respectively) 
(fig. SIB). By contrast, adding a S'Tg flap to the 
low-affinity S'pTg (S'pTg/B'Tg) minimally en- 
hanced its affinity, with the resulting 377 nM 
a factor of -1000 weaker than that of 5'pT]/3T8 
(fig. SIB). 

Based on these findings, we determined the 
2.9 A structure of a 5'pG]/3T4 DNA bound to 
human FANl (residues 364 to 1017), N-terminaUy 
truncated to remove the ubiquitin-binding UBZ 
domain and subsequent unstructured segment, 
as well as the structure of apo-FANL (fig. S2A and 
table SI). FANl has a bilobal structure consisting 
of a 223-residue N-terminal domain (NTD) and a 
415-residue C-terminal domain (CTD) that con- 
tains the PD-(D/E)XK nuclease motif (Fig. 1, A 
and B). The DNA adopts a V-shaped structure, 
with a kink at the nick. The prenick and postnick 
duplexes have an overall B-form DNA (B-DNA) 
conformation, with a 76° angle between them. 
The prenick duplex and 3' flap are bound by the 
NTD, whereas the 5' flap and postnick duplex are 
bound by the CTD. There are no major confor- 
mational changes between the apo- and DNA- 
bound FANl structures (fig. S2B). 

The NTD consists of a helical domain, a winged- 
hehx DNA binding domain, and the predicted SAP 
domain (Fig. 1C). These form a continuous surface 
that binds to a 9-base pair duplex segment lead- 
ing to the 3' flap and to a 2-nt segment of the flap 
(Fig. lA and fig. S3, A to D). The duplex contacts 
involve only phosphodiester groups (fig. S3, A to 
D), except for the flap-proximal base pair, which is 
contaicted at both its phosphodiester and base 
groups (Fig. 2A). These base contacts block the 
DNA if om extending as a regularly stacked duplex, 
analogous to the helix-breaking wedges observed 
with other structure-spedflc nucleases (14, IS). 

Only the first two nucleotides of the 3T4 flap 
are ordered. These extend away from the duplex 
and are bound by FANl (Fig. 2A). The base and 
ribose groups of the first nucleotide are contacted 
by Val®”, and Arg®®. The phosphate group 
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of the second nucleotide binds to a pocket at the 
N terminus of an a helix and hydrogen bonds to 
a backbone amide group (Tyr^^'*’) in a partially 
buried environment. These 3'-flap contacts are 
consistent with the addition of a single-nucleotide 
3' flap to 5'pTi resulting in roughly half of the 
affinity increase of an 8-nt flap (for 5'pTi/3Ti, 
= 1.9 nM) (fig. SIB). 

The CTD starts with a 180-residue helical 
repeat domain that belongs to the tetratrico- 
peptide repeat (TPR) family, followed by a nu- 
clease domain that has the ~120-residue apppafl 
fold of the PD-(D/E)XK superfamily of Mg^"^- 
dependent nucleases (16, 17) but that additionally 
contains a 125-residue helical insertion (Fig. 1C). 
The CTD has a wide and deep groove, with the 
TPR domain on one side and the nuclease and 
insertion domains on the other, into which the 
postnick duplex binds (Fig. IB). FANl contacts 
the 5' phosphate group of the flap and eight phos- 
phodiester groups clustered at the flap-distal por- 
tion of the duplex (Fig. 2B). The backbone of the 
1-nt flap extends orthogonally from the duplex. 
The 5' phosphate group abuts a highly basic pocket 
at the bottom of the CTD groove, with its three 
nonbridging oxygen atoms hydrogen bonding 
with the side chains of Arg™®, Arg®“, His^''’^, and 
Lys®®® (Fig. 2, C and D, and fig. S4A). These con- 
tacts explain FANl’s strict requirement for a 5' 
phosphate group. Accordingly, the Arg™® — ► 
Ala™®/Arg®®^ — > Ala®®^ double mutation reduces 
FANTs affinity for 5'pT]/3Tg by a factor of -400 
(K^ = 210 nM) (fig. S4B). After the 5' phosphate 
group, the next two phosphodiester groups make 
relatively few contacts, whereas a nonbridging 
oxygen of the third phosphodiester group coor- 
dinates the Ca^"^ ion at the nuclease active site 
(Fig. 2D). 

The Ca^"^ ion, which does not support nu- 
clease activity (18), has a coordination shell sim- 
ilar to other nucleases (17, 18) (fig. S4, C to E). 
This indicates that cleavage occurs at the third 
phosphodiester group from the 5' phosphate 
(hereafter “+3 register”) and results in a S' phos- 
phorylated product, both of which we confirmed 
by liquid chromatography-coupled mass spec- 
troscopy (LC-MS) of a nuclease reaction. 

In additional cocrystal structures that we 
determined (table SI and fig. S5A), the S' phos- 
phate groups of S'pTj/BTg, S'pTj, and 5'pNi bind 
to the same FANl pocket and contact the same 
four side chains as in the 5'pGi/3T4 structure 
(fig. S5, A and C). By contrast, the pocket is 
devoid of DNA in the cocrystal structure of the 
5'-hydroxyl 5 T 2 , consistent with FANl’s specificity 
for a terminal phosphate. The S'pTi/B'Tg, 5'pTi, 
and 5'pNi structures also suggest some inherent 
flexibility in the register of the scissile phosphate 
relative to the S’ phosphate group. In the S'pTVBTg 
and 5'pTi structures, an extra nucleotide is ex- 
truded out in a loop as the 5'-fiap backbone tran- 
sitions to the duplex, resulting in an na-i-4 scissile 
phosphate register (fig. S5, B and C). In the 5'pNi 
structure, a shifted trajectory for the duplex re- 
sults in a +2 register (fig. S5, C and D). 

The importance of a S' phosphate group on a 
short flap or nick is recapitulated in FANl nu- 


clease assays. In the presence of 150 mM NaCl 
and with 100 nM substrate, 5'pT]/3T8, pNi/BTg, 
and 5'pT2/3Ta are cleaved starting with 0.1 nM 
FANl, whereas the longer 5'-flap S'pTg/BTg sub- 


strate is not noticeably cleaved even with 100 nM 
FANl (Fig. 3A and fig. S6). fi'pTg/BTg cleavage re- 
quired very low ionic strength, such as the 25 mM 
NaCl used in previous studies, but its extent was 



Fig. 1. Overall structure of FANl-5'pGi/3T4. (A) Cartoon representation. NTD, pink: CTD, light 
blue, dsDNA, yellow; flaps and 5' phosphate spheres, red; and Ca^’’’, purple. The Inset shows a line 
drawing of the DNA. (B) View looking down the vertical axis of (A). (C) The six different domains of 
FANl are colored according to the linear representation. WH, winged helix. 



Fig. 2. FANl-DNA contacts. (A) Close-up view of 3'-flap contacts showing side chains Involved In 
either hydrogen bond (green dotted lines) or van derWaals contacts.TThr; V, Val; R, Arg; L, Leu; E, Glu;Y, 
Tyr: R Pro. (B) Overview of the CTD-DNA contacts, with the 5' flap and its phosphate group shown in red. D, 
Asp; K, Lys; H, His. (C) Surface representation of the 5' phosphate binding pocket, colored by the 
electrostatic potential (±7kT. where k is Boltzmann's constant and T is temperature), with the scissile strand 
shown as yellow sticks, the 5' phosphate oxygen atoms in red, and the continuous strand as thin yellow lines. 
(D) Close-up view of contacts to the 5' phosphorylated flap and the three following nucleotides of the scissile 
strand. Continuous strand shown as thin lines. G, Gly. 
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Still substantially lower than that of S'pTi/STg 
(Fig. 3A and fig S6). 

The +3 scissile phosphate register suggests 
that FANTs reported 5 - to 3'-exonuclease activity 
results from successive incisions releasing 3-nt 
fragments. In the time course of a 1 liM stoichio- 
metric FANl-S'pTj/STg reaction, an 18-nt product 
corresponding to a +3 incision appears first, its 
level peaking at 8 min and then declining as it 
gets consumed (Fig. 3B). A 15-nt product (two -h 3 
incisions) appears second and peaks at 16 to 
32 min, whereas a 12-nt product (third incision) 
appears last. The 5'pNi/3'T8 substrate produces 


a similar pattern of sequential cuts with a major 
-r3 spacing (Fig. 3B). Minor products correspond- 
ing to -r4 and +2 incisions probably reflect the 
alternate scissile phosphate registers of the 5' 
pTi/STs and 5'pNi eocrystal structures, respec- 
tively (fig. S5C). 

Successive incisions do not occur in a detect- 
ably processive manner (fig. S7A) but meet the 
criteria of a distributive activity. FANl binds to a 
3-nt gap substrate (pGap3/3T8), which corre- 
sponds to the first incision product of S'pNi/STg 
[Ad = 3.6 nM, compared with 1.0 nM for 5'pNi/3' 
Tg (fig. S7B)], and also cleaves this substrate to 


generate products equivalent to the second and 
third products of pNi/3T8 (Fig. 3C). To determine 
the effect of increasing gap length, we assayed a 
S'pTi/S'Tg substrate containing a long, 35-base 
pair postnick duplex. The sequential incisions 
stalled after four cuts even in a 100 nM stoichio- 
metric reaction (fig. S7C), suggesting that a gap 
length beyond ~12 nts of ssDNA, which presum- 
ably loops out at the NTD-CTD interface, sub- 
stantially reduces FANl binding efficiency. In vivo, 
the stalling could be inconsequential if FANl 
acts at a site with a complementary 3' flap or in 
cooperation with a distributive polymerase, either 
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Fig. 3. Nuclease activity. (A) Denaturing PAGE of nuclease reactions with the indicated substrates (100 nM), FANl, and NaCI concentrations. The 
substrate scissile strand and molecular weight standards (st.) are labeled with 3' a-^^P deoxyadenosine triphosphate (asterisks). The column graphs 
show the sum of all cleaved bands divided by total DNA, as the mean + SD (error bars) from three experiments. (B) Nuclease time course of S'pTj/S'Tg 
(left) and b'pNi/S'Ts (right) in 150 mM NaCI. Bands and the corresponding molecular weight standards are marked. Fractional amounts of each major 
product and their total (“all”) are graphed as the mean + SD (error bars) from three experiments. (C) Nuclease time course of pGaps/S'Tg, as in (B). 
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ICL band (“unhooked") was confirmed by LC-MS of the band eluted 
ratio of unhooked bands to unhooked plus uncut, corrected for the 
so it initially rises faster than the ICL-6 curve that does not reflect 
(blue) and LC-MS (red), with line height reflecting relative product 


Fig. 4. ICL unhooking. (A) Cu-catalyzed azide- 
alkyne cycloaddition and the expected 1,4-isomer 
of the triazole ICL. (B) Nuclease time course of the 
indicated ICL substrates and the unmodified 
control substrate (Crl-6), all in 150 mM NaCI. The 
uncut substrates and the products of the pre-ICL 
cuts (“uncut & early products”) are not resolved 
due to the cross-linked nature of the long oligonu- 
cleotides (fig. S8). The identity of the unhooked 
from a PAGE gel of the ICL-5 reaction (table S2). (C) Quantitation of (B), graphing the 
presence of two ^^P-labeled strands in uncut bands. The Crl-6 curve shows total cuts, 
the unresolved cuts one and two. (D) Vertical lines mark incisions identified by PAGE 
abundance and numbers indicating product length (nucleotides). 
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of which can close the emergent gap and restore 
high-affinity binding. 

The -1-3 exonuclease mechanism is well suited 
for unhooking an ICL through two successive 
flanking incisions. To test this, we synthesized 
a stable ICL consisting of two cytosines linked 
at their N4 positions through a triazole moiety 
(19) (Fig. 4A and fig. S8). We first assayed a tria- 
zole ICL with the cross-linked cytosine posi- 
tioned 6 nts after the S' phosphorylated nick of 
a 5'pNi/3T8 substrate (ICL-6). In a 1 nM stoichio- 
metric reaction, FANl robustly cleaved past the 
ICL, giving rise to post-ICL products that match 
those of the unmodified control substrate (Crl-6) 
with similar rates (Fig. 4, B and C). By LC-MS 
analysis of the 64-min reaction, we confirmed 
that this is due to dual incisions S' and 3' to the 
ICL. Our LC-MS analysis identified a major mo- 
lecular species (37% of the LC-ultraviolet quan- 
titation) (table S2) consisting of the continuous 
strand, the triazole linkage, and a 3-nt fragment 
of Cyt, Gua, and Thy (11653.995 and 11653.387 
daltons, respectively, for the observed and cal- 
culated molecular weight) (table S2). The ICL-6 
products (Fig. 4D) also demonstrate that FANl 
can cleave immediately S’ to the ICL, with the S’ 
phosphodiester group of the cross-linked C 540 - 
sine at the scissUe position. 

We next addressed whether FANl can un- 
hook the triazole ICL, Irrespective of its regis- 
ter relative to the S' phosphate group. For this, 
we tested ICLs with the cross-linked cytosine 
positioned 5, 4, or 3 nts after the S’ phosphoryl- 
ated nick (ICL-5, ICL^, and ICL-3. respectively). 
FANl unhooked ICL^ and ICL-3 at faster rates 
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compared with ICL-6, consistent with this re- 
quiring only two incisions compared with three, 
whereas the rate of ICL-5 unhooking was inter- 
mediate (Fig. 4, B to D). LC-MS and denaturing 
polyacrylamide gel electrophoresis (PAGE) of 
the ICL-5 reaction also indicated that FANl can 
cleave immediately 3' to the ICL (Fig. 4, B to D, 
and table S2). 

Diverse ICLs can cause a wide range of DNA 
structure distortions, but these are mostly lim- 
ited to one or two base pairs flanking the lesion 
(20). The structurally and biochemically observed 
±l-nt shifts in the scissile phosphate register, 
coupled to the paucity of FANl-DNA contacts in 
the immediate vicinity of the scissile phosphate 
group, suggest that the FANl active site may have 
substantial flexibility for accommodating such 
distortions. Our DNA binding and cleaving data 
strongly suggest that FANl acts at sites that have 
a S' phosphate group on a short flap or nick, pos- 
sibly generated by an upstream nuclease. Such 
sites may also arise from the regression of longer 
S' phosphorylated flaps with the concomitant 
generation of a complementary 3' flap. If FANl 
accesses the DNA at such a site, it can keep ex- 
cising with 3 '-flap annealing until it reaches and 
unhooks the ICL. 
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NEW PRODUCTS: MICROSCOPY 


Superresolution Microscopy 

LSM 880 with Airyscan is a new confocai 
iaser scanning microscope which offers 
high sensitivity, enhanced resoiution in 
X, y, and z, and high image-acquisition 
speed in one system. Users achieve 
a 1 .7x higher resoiution in aii spatiai 
dimensions, 140 nm lateraliy and 400 
nm axiaily. The high sensitivity ieads 
to better image quality and increased 
speed. A ciassic confocai microscope 
iiluminates one spot on the sample to 
detect the emitted fiuorescence signai. 

Out-of-focus emission iight is rejected at 
a pinhole, the size of which determines 
how much of the Airy disk reaches the 
detector. Users can increase the resolu- 
tion by making the pinhoie smaiier, but 
the signai-to-noise ratio drops signifi- 
cantly, since less valuable emission light 
is passing through. LSM 880 lets the 
user take full advantage of large fields of 
view and the highest speed of any linear 
scanning confocai microscope at an 
unsurpassed image quality. 

ZEISS 

For info: 800-233-2343 
www.zeiss.com/lsm880 

Superresolution Microscopy 
Software 

The Olympus FV-OSR software module 
enables users of the FluoView FV1200 
easy access to what was previously the 
domain of only specialized microscopy 
systems. The extra detail offered by 
superresolution technology facilitates 
researchers in revealing the most subtle 
mechanisms at work within cells or tis- 
sue. Three-dimensional confocai laser 
scanning microscopy with the FluoView 
FV1200 achieves a vastly improved lateral 
and axial resolution compared to stand- 
ard widefield, but is still limited by the 
diffraction limit of light. In certain settings 
images can therefore suffer from poor 
contrast. This is where superresolution 
microscopy comes into play, breaking the 
diffraction limit of light and enhancing cell 
and tissue imaging studies. Drawing from 
the latest Olympus optical and digital technologies, the FluoView 
FV1200 can now be transformed into a system capable of reaching 
a resolution of up to 120 nm, with simultaneous multicolor capture. 
The proprietary technology of the FV-OSR software controls spe- 
cific hardware settings and advanced signal processing. 

Olympus 

For info: +49-402-3773-5913 
www.olympus-europa.com/microscopy 
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Optical Imaging System 

The Solaris quantitative optical molecular 
imaging system is a preclinical solution 
for use in small and large animal stud- 
ies will help to advance drug discovery 
and translational research to develop 
and validate molecular-guided surgical 
research protocols. The Solaris system, 
when combined with PerkinElmer’s 
suite of fluorescence molecular imaging 
probes, can bridge molecular imaging 
research techniques to support poten- 
tial clinical outcomes. By distinguishing 
healthy versus diseased tissue in real- 
time, researchers can more accurately 
and completely map tumors. The Solaris 
system is designed for ambient lighting 
conditions found in preclinical surgical 
research suites and can dynamically use 
a broad and versatile range of fluorescent 
probes, offering flexibility for translational 
researchers when targeting diseases. Its 
research applications include drug effica- 
cy, drug safety, measurement of real-time 
biological therapeutic responses, and 
advanced molecular-guided surgery ap- 
plications such as surgical tumor margin 
determination in live animal models. 
PerkinElmer 
For info: 877-754-6973 
www.perkinelmer.com 


Deep-Cooled CCD Camera 

The new Syncerity back-illuminated, 
deep-cooled CCD camera comes with 
an NIR-enhanced 2048 x 70 sensor. This 
CCD sensor is designed for companies 
requiring an affordable OEM camera for 
VIS-NIR spectroscopy applications. This 
new Syncerity camera joins the existing 
Syncerity with front illuminated 1024 x 
256 CCD sensor for UV-VIS-NIR appli- 
cations. Its high-resolution 14 pm pixel 
size makes it ideal for Raman instru- 
mentation. Syncerity NIR offers ultralow 
etaloning and more than 40% Quantum 
Efficiency at 1 ,000 nm. Peak QE of 84% 
at 700 nm, and 20% at 1 ,050 nm means 
the Syncerity back-illuminated NIR of- 
fers a broad response. It is also available 
in a back-illuminated UV-VIS optimized 
configuration. Its lifetime vacuum war- 
ranty and compact size make it ideal for 
OEM integration, particularly in Raman 
microscopes, and its flexible design al- 
lows the OEM-dedicated Floriba team to 
quickly adapt the camera for industrial 
applications. 

Horiba Scientific 
For info: 732-494-8660 
www.horiba.com/synceritynir 

Inverted Microscope 

The Leica DMi8 inverted microscope 
with built-in modularity is designed to 
grow and adapt to ever-changing re- 
search needs. With this new concept, 
life scientists can custom-configure an 
inverted microscope now and upgrade it 
in the future for applications ranging from 
basic imaging to advanced fluorescence 
microscopy. The Leica DMi8 is equipped 
with an additional incident illumination 
port, the Infinity Port, which facilitates the 
integration of additional light sources and 
laser systems for advanced applications. 
The closed-loop focus drive with an ac- 
curacy of 20 nm for the 12 mm travel 
range enables researchers to investigate 
large specimens with high precision. In 
combination with Leica Microsystems' 
imaging software the Leica DMi8 is an 
optimal solution for live-cell microscopy. The Leica DMi8’s Infinity 
Port is the core of the system and enables access to the infinite light 
path. It integrates light sources and lasers that are necessary for 
FRAP, photo-switching, ablation, and many state-of-the-art micros- 
copy techniques. 

Leica Microsystems 
For info: 800-248-0123 
www.leica-microsystems.com/dmi8 
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The E.U.’s Recipe 
to Create Jobs: 
Innovate 

Horizon 2020, the European Union’s new funding mechanism, 
is the concrete transiation of Europe 2020, a 1 0-year strategy 
proposed by the European Commission to advance Europe’s 
economy in a “smart, inciusive, and sustainabie” way. it 
refiects the widespread sentiment that Europe has faiien 
behind the rest of the worid in innovation. By supporting 
areas in which Europe aiready shows strong competence, 
especiaiiy at the near market stage where cruciai funding 
tends to faiis short, the European Union aims to emerge as a 
worid ieader in the areas targeted and grow its economy at 
the same time. Aiong the way, it is creating new opportunities 
for scientists. By Gunjan Sinha 


T his year saw the iaunch of the European Union’s (E.U.) 
eageriy anticipated new funding mechanism. Horizon 
2020. Between 2014 and 2020, the European Commis- 
sion wiii make €78.6 biiiion (US$99.7 biiiion) avaiiabie to fund 
research and innovation throughout the E.U. The budget is a 
€25 biiiion (US$31.7 biiiion) increase over the E.U.’s previous 
funding mechanism— the 7th Framework Programme (FP7)— 
and takes the concept of goai-oriented research severai steps 
beyond its predecessor. 

Horizon 2020 is organized into three core themes: excei- 
ient science, industriai ieadership, and societai chaiienges. 
The themes continue many of those introduced under FP7, 
especiaiiy toward the end of FP7’s tenure when it shifted to a 
chaiienge-based thematic approach. Horizon 2020, however, 
g in addition to a 60% increase in funding for basic research, 

I goes whoiesaie on the chaiienge-based approach, says 
g Michael Jennings, a European Commission spokesperson, 

h 
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and emphasizes deveioping soiutions to the worid’s most 
pressing environmentai and societai threats to heaith and weii 
being. 

“There is a demand for science to be more responsive and 
to address the urgentiy compiex societai chaiienges of our 
times,” says Maire Geoghegan-Quinn, European Commis- 
sioner for Research Innovation and Science. “As a poiitician 
I am evangeiicai about the power of science to improve and 
enrich our iives and to sustain our economy.” 

The societai chaiienges piiiar secured 38.53% of the 
budget— the iargest chunk of the three— and is further broken 
down into key themes that inciude heaith, forestry, marine 
research, efficient energy, green transport, ciimate, and 
environment. The focus on key themes aims to inspire more 
muitidiscipiinary approaches, says Jennings, and encourage 
researchers to come up with the right approach rather than 
prescribe it. Since innovation is at the core of Horizon 2020, 
it aiso has a goai of 20% smaii and medium enterprise (SME) 
participation in aii funded projects and wiii support more iate- 
stage research, such as piiot and demonstration projects. 

Impiicit in funding such iarge research consortia are the job 
opportunities they create for eariy-career scientists. Large 
consortia connect researchers around the giobe with each 
other and aiso with industry, creating a path through which 
young scientists can move on or up in their careers. 

Late-stage funding for sustainable energy 

One of the first major Horizon 2020 awards announced 
in Juiy is a €3.7 biiiion (US$4.7 biiiion) commitment to a 
pubiic-private partnership aiming to deveiop Europe into 
a biobased economy. More than 70 European companies 
in the biotech, chemicai, energy, agricuiture, and puip and 
paper sectors are part of the initiative, which is coordinated 
by Copenhagen, Denmark-based Novozymes. Of the €3.7 
biiiion, the European Commission wiii contribute €975 
miiiion (US$1.2 biiiion) between 2014 and 2020; the rest wiii 
come from industry partners. 

Novozymes is an industriai biotechnoiogy company that 
produces and seiis enzymes and microorganisms used in 
peopie’s everyday iives, from processed food and detergents 
to farming and other industriai processes. As the cost of 
biotechnoiogy comes down, many chemicais are being 
repiaced with biotech, says Peder Hoik Nielsen, president 
and chief executive officer of Novozymes. The company now 
produces miiiions of unique enzyme moiecuies each month 
when just a few years ago it couid oniy produce thousands, 
he expiains. “Access to [the technoiogy] is expioding,” he 
says, which in turn presents a huge opportunity to appiy the 
technoiogy toward a greener economy. 

The partnership’s primary focus is to provide a renewabie 
aiternative to the fossii-based fueis that power automobiies 
and are components of industriaiiy produced chemicais and 
materiais such as fabrics and piastics. The company and its 
partners have aiready buiit a biomass refinery in Crescentino, 
itaiy that produces 75 miiiion iiters a year of ceiiuiosic ethanoi 
from agricuiturai waste using the company’s enzymes. The 
majority of the €3.7 biiiion wiii be spent on buiiding at ieast 
five additionai production piants across continued> 
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When Fabio Grieco 
finished his Ph.D., he 
wanted to work abroad. His 
advisor suggested taking 
a fellowship position with 
Decio Eizirik, another 
NAIMIT partner at the 
University of Brussels. 



Europe by 2020. “These are essential to show the technical 
and commercial viability of new generations of biobased 
products and fuels at an industrial scale and to reduce 
perceived investment risks,” says Hoik Nielsen. 

Each plant is expected to generate between 12,000 and 
18,000 feedstock supply chain jobs annually, according to 
a report by National Non-Food Crops Centre, bioeconomy 
consultants in York, England. The boom in industrial 
biotechnology is also creating jobs for scientists. Novozymes, 
for example, is “hiring a lot of people globally,” says Peter 
Tolstoy, a people and organization partner responsible 
for hiring scientists at the company, which has 36 global 
locations and employs some 1 ,300 scientists. 

The Bio-Based Industries (BBI) Consortium is exemplary of 
the kind of late-stage innovation activities that Horizon 2020 
is aiming to fund, not only because of its potential to grow the 
economy and create jobs, but also because such consortia 
build relationships between industry and academia that foster 
innovation. Another key change from FP7 is 5% more funding 
to support SMEs that typically take university research and 
turn it into marketable goods. Twenty percent of the budget 
available for research on societal challenges and leading 
technologies— almost €9 billion (US$1 1 .4 billion) in grants 
over seven years— will be available to SMEs. Of this amount, 
€3 billion (US$3.8 billion) will be provided through a dedicated 
“SME instrument,” a specific line of calls, to fund feasibility 
studies and demonstration projects. Part of the E.U. research 
budget will also be used to back public and private loans to 
SMEs. 

“The €9 billion should be seen as a minimum,” says 
Geoghegan-Quinn. “SMEs are the backbone of the European 
economy and account for two thirds of total employment.” 

Another consortium with large SME participation working 
toward greening the economy is LEANWIND (Logistic 
Efficiencies And Naval architecture for Wind Installations with 
Novel Developments) headed by Jimmy Murphy, lecturer 
at Beaufort Research, University College Cork, Ireland. 
Securing one of the last awards under FP7, LEANWIND is a 
consortium of 31 partners from 1 1 countries, half of which 
are small companies working toward making wind energy 
competitive with fossil fuel energy sources. LEANWIND will 
receive €10 million (US$12.7 million) from the European 
Commission between 2014 and 2017 to find ways to 


streamline the installation, management, and maintenance 
of offshore wind farms. Industry partners will contribute an 
additional €5 million (US$6.3 million). To cut costs, “there is 
an entire logistical chain of events that can be optimized,” 
says Murphy. 

For early-career scientists, collaborative research projects 
that include industry are invaluable for the contacts they 
provide. Katie Lynch, for example, landed her job as a 
LEANWIND work package leader after collaborating with 
Murphy on a related FP7-funded project. While working 
on that project, the 30 year old Lynch, who nas a Master’s 
degree in renewable energy, was asked to help put together 
the FP7 proposal for LEANWIND, with the intention that she 
would work on the project if it were funded. The experience 
of writing the FP7 proposal and of working in her current 
position managing finance and market assessments has 
given her precious experience in leadership, she says, as well 
as put her in touch with industry leaders “You have to put 
together a consortium of people with whom you would like to 
work with over and over again and maintain those contacts,” 
she says. 

Such consortia also build long-term connections between 
academics and industry, boosting job prospects. “It would be 
wonderful if it was natural for academics to move to industry 
and vice versa,” says Geoghegan-Quinn. “In the U.S. this 
happens all the time. We are setting up the architecture to 
allow that to happen [here in Europe].” 

Basic research not neglected 

Even though the societal challenges pillar of Horizon 2020 
is goal oriented, basic research can be funded if scientists 
can show practical relevance. Since Horizon 2020 just 
launched, there are a several projects funded under FP7 that 
the European Commission considers exemplary of the type of 
life science research Horizon 2020 aims to fund. 

The Natural Immunomodulators as Novel Immunotherapies 
for Type I diabetes (NAIMIT) consortium, for example, 
investigated a controversial theory about diabetes type 1 
to come up with new therapies. Composed of 14 academic 
partners and two biotech companies, the group was 
brought together by Chantal Mathieu, head of clinical and 
experimental endocrinology at the University of Leuven, 
Belgium. Pancreatic beta cells aren’t just “sitting ducks” in 
the pathogenesis of diabetes, Mathieu explains. There’s an 
active dialogue between beta cells and the immune system 
that mediates beta cell death.” Mathieu brought in partners 
whom she thought could illuminate this process and develop 
therapeutic approaches based on it. 

It is through NAIMIT that Fabio Grieco landed his current 
job as a postdoctoral fellow at the Universite Libre de Brux- 
elles (ULB), Belgium, Laboratory of Experimental Medicine, 
and ULB Center for Diabetes Research. In 2010, Grieco was 
working on his Ph.D. at the University of Siena, Italy when 
his supervisor, Francesco Dotta, began collaborating with 
Mathieu. When Grieco finished his Ph.D., he wanted to work 
abroad. His advisor suggested taking a fellowship position 
with Decio Eizirik, another NAIMIT partner at the University 
of Brussels. He has been at the center for diabetes research 
since 2012. Although his fellowship time is up, he has the op- 
portunity to continue his research at the center. 

The NAIMIT consortium was funded under continued> 
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European Research Council 

Established by the European Commission 
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Got a golden idea? 
We’ll fund it. 



The European Research Council (ERC) offers five-year grants of up to $4.4 million. 

■ open to top researchers from anywhere in the world 

■ for realising ambitious projects - at least half the time spent in Europe 

■ in any field: physical sciences & engineering, life sciences, social sciences & humanities 

Want to find out more? 

Visit erc.europa.eu and apply for Starting Grants (deadline: 3 Feb 2015) 
or Consolidator Grants (deadline: 12 Mar 2015) 


Horizon 2020 
European Union funding 
for Research & Innovation 
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When Sebastian 
Rauschert 
saw that the University 
of Munich offered a 
Ph.D. program in eariy 
nutrition through The 
Eariy Nutrition Project, 
he jumped at the 
chance. 



Featured Participants 

Beaufort Research, 
University College, Cork, 
Ireland 

www.ucc.ie/beaufort 

Early Nutrition Project 

www.project-earlynutrition.eu 

European Commissioner 
for Research Innovation 
and Science 

ec.europa.eu/ 
commission_201 0-2014/ 
geoghegan-quinn/index_ 
en.htm 

European Research 
Council 

erc.europa.eu 


LEANWIND Consortium 

www.leanwind.eu 

NAIMIT Consortium 

www.naimit.eu 

Novozymes 

www.novozymes.com 

University of Leuven 

www.kuleuven.be/english 

Universite Libre de 
Bruxelles 

www.ulb.ac.be 


FP7 with a total budget of €14.25 million (US$18.08 
million), of which the European Union contributed €10.92 
million (US$13.8 million). The rest came from participating 
universities or individual research grants. E.U. funding 
under FP7 ends this year. Two of the projects, however, 
have entered the clinic, says Mathieu, and research on a 
bacterial system to deliver immune modulating compounds 
systemically described in the group’s 2012 paper “is 
extremely promising,” she adds. 

Another exemplary health project funded under FP7 is 
The Early Nutrition Project, which includes researchers from 
36 institutions in 12 countries across Europe, the United 
States, and Australia. The project is being coordinated out 
of Ludwig-Maximilian’s-University in Munich, Germany 
and aims to understand how early nutrition programming 
and lifestyle factors impact the rates of obesity and related 
disorders. Research projects include studying how maternal 
weight during pregnancy affects the long-term health of 
babies, how the placenta regulates fetal growth and how 
early nutrition affects children’s long-term risk of metabolic 


diseases. The project is funded with €8.96 million (US$ 1 1 .37 
million) from the European Union between 2012 and 2016, 
with a total budget of €1 1.12 million (US$ 14.1 million). 

Such large research consortia open up opportunities for 
young people in other ways too. Sebastian Rauschert stud- 
ied sociology and psychology as an undergraduate. Fie then 
earned a Master’s degree in public health. Fie has “always 
been interested in disease prevention and nutrition,” he says. 
When he saw that the University of Munich offered a Ph.D. 
program in early nutrition through The Early Nutrition Project, 
he jumped at the chance. As part of his Ph.D. thesis, he is 
studying whether breastfeeding influences obesity related 
metabolic disorders, such as diabetes, by analyzing data 
from the Western Australian Pregnancy Cohort Study, which 
has followed over 1 ,000 babies since 1989. This month 
Rauschert will travel to Perth to spend six months working 
directly with his collaborators. The trip is funded through the 
brain mobility project, established under the Early Nutrition 
Project to enable young scientists working on the project the 
opportunity to travel abroad and work with collaborators, and 
incidentally give them a networking boost. 

E.U. grants carry cache 

For early-career scientists, landing a job with E.U. -funded 
consortia isn’t the only stepping-stone available. They can 
apply for individual European Research Council (ERC) grants, 
which with a budget of €13.1 billion (US$ 16.6 billion), repre- 
sents a 60% increase over FP7, or a Marie Skfodowska-Curie 
actions (MSCA) grant, both of which are included under the 
Excellent Science pillar. MSCA grants are awarded in all sci- 
entific disciplines at all stages of researchers’ careers, from 
Ph.D. candidates to highly experienced researchers. 

For those who beat out the competition to win an ERC 
grant, the job prospects become even brighter. “It’s imme- 
diately considered as positive recognition of the quality and 
ambition of the scientist,” says Jean-Pierre Bourguignon, 
president of the ERC. “There is competition now among insti- 
tutions to secure these people.” 

It also can make scientists more attractive to industry, says 
Bourguignon. One Swedish ERC grant winner, for example, 
was able to garner funding for two startups based on the 
credibility the grants gave him, says Bourguignon. “Fie initially 
didn’t think he was a candidate. Fils colleagues pushed him 
to apply.” 

For Horizon 2020 applicants, proposals that take a mul- 
tidisciplinary approach are more likely to be successful. 

“Be prepared to step outside of your comfort zone,” says 
Geoghegan-Quinn. “We are asking scientists to be bold and 
brave and not stay pigeon holed in the same area.” 

Horizon 2020 boasts other improvements over FP7 that 
include a move to online application and review, shorter 
review times, and streamlined auditing — measures that not 
only modernize the process but also address major criticisms 
of its predecessor. On measuring Horizon 2020’s success, 
Geoghegan-Quinn’s says: “I am hoping the money we are 
investing in these societal challenges will not only make my 
life better but also give hope to young people.” 


Gunjan Sinha is a freelance writer living in Berlin, Germany. 
DOI: 1 0. 11 26/science. opms.rf 4001 50 
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Looking for a PhD position? 

Join the 1ST Austria Grad School! 


9HYS/C, 













PhD Program 

^ PhD granting institution devoted to basic 

research in natural sciences and math 

^ 4-5 year study 

^ BS or MS degree (or equivalenf) required 

-> internationally competitive PhD salary 

www.ist.ac.at 




ST 


AUSTRIA 


Institute of Science and Technology 


Deadline for applications 

for students wishing to enter the 
program in the fall of 2015 is 

January 15. 2015 



1ST Austria is committed to Equality and Diversity. 


PRIZES 


Foundation 
for Polish Science 


The Foundation for Polish Science has for the 23rd time granted its Prizes, 
considered to be Poland’s most important scientific awards. 


The Foundation for Polish Science Prize 2014 Laureates 






PROF. TOMASZ GOSLAR from 
Adam Mickiewicz University in 
Poznan received the FNP Prize 
2014 in the life and earth scienc- 
es for his contribution to deter- 
mining the chronology of changes 
in C14 carbon isotope concentra- 
tion in the atmosphere during the 
last ice age, which is of key impor- 
tance for contemporary climate 
research. 


PROF. KAROL GRELA from the 
University of Warsaw and the 
Institute of Organic Chemistry of 
the Polish Academy of Sciences in 
Warsaw received the FNP Prize 
2014 in the chemical and mate- 
rials sciences for developing new 
catalysts for olepn metathesis 
reactions and applying them in 
industrial practice. 


PROF. IWO BIAtYNICKI-BIRULA 

from the Centre for Theoretical 
Physics of the Polish Academy of 
Sciences received the FNP Prize 
2014 in the mathematical, physi- 
cal and engineering sciences for 
fundamental studies of the elec- 
tromagnetic peld that has led to 
the formulation of the uncertain- 
ty relations for photons. 


PROF. LECH SZCZUCKl from 
the Institute of Philosophy and 
Sociology of the Polish Academy 
of Sciences received the FNP Prize 
2014 in the humanities and so- 
cial sciences for explaining the 
cultural ties between Central and 
Western Europe in a monumental 
edition of the correspondence of 
Andrzej Dudycz, the 16th-century 
thinker, religious reformer and 
diplomat 


The Foundation for Polish Science has been in operation since 1991. It is a non-governmental, non-political, non-propt institution which 
pursues the mission of supporting science. It is the largest non-propt organization in Poland dedicated solely to the development of science. 

www.fnp.org.pl 
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Institut Pasteur 


Group Leader Positions in 

Quantitative Biology and Modeling 


The Institut Pasteur (Paris, France) announces an international 
call for group leader positions in thefield of quantitative biology 
and modeling in developmental systems. Successful applicants 
will be integrated into the cutting edge interdisciplinary 
environment offered by an internationally renowned institute 
combining fundamental and translational research, in an 
attractive location in central Paris, in close proximity to other 
major research centers. Candidates with both an experimental 
and theoretical background, using quantitative approaches 
and willing to develop multidisciplinary projects related to 
developmental processes are encouraged to apply. 

Successful junior candidates [1] will be appointed with a 
permanent position, and as head of a group of 6 people. 
These groups will be created for a period of 5 years and may 
thereafter compete for a full research group. 

Successful mid-career and senior candidates will be appointed 
with a permanent position, and as head of a research group of 
8 to 12 people. The groups will be created for 10 years (mid- 
term evaluation at 5 years) with the possibility of renewal. 
Candidates should send theirformal applications by E-mail to 
the Director of Scientific Evaluation, Prof. Alain Israel, at the 
Institut Pasteur (qubimo(S>pasteur.fr). 

Application deadline: February 6, 2015 
Short-listed candidates will be contacted for interview. 


Applicants should provide the following (in order) in a single 
pdf file : 

1. Abriefintroductory letter of motivation, including the name 
of the proposed group. Candidates are encouraged to contact 
the head of the Search Committee Francois Schweisguth 
(fschweis@pasteur.fr) or the head of the Department of 
Developmental & Stem Cell Biology Shahragim Tajbakhsh 
for queries (shaht@pasteur.fr). 

2. A Curriculum Vitae and a full publication list. 

3. A description of past and present research activities (up to 6 
pages with 1.5 spacing; Times 11 or Arial 10 font size). 

4. The proposed research project (up 6 pages with 1.5 spacing; 
Times 11 or Arial 10 font size). 

Junior candidates [1] should also provide: 

5. The names of 3 scientists from whom letters of 
recommendation can be sought, together with the names 
of scientists with a potential conflict of interest from whom 
evaluations should not be requested. 

[1] Institut Pasteur is an equal opportunity employer. Junior 
group leaders should be less than 8 years after PhD at the time 
of submission (Dec 31, 2014). Women are eligible up to 11 years 
after their PhD if they have one child and up to 14 years after 
their PhD if they have two or more children. 
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The Universidade NOVA de Lisboa invites applications for 
a full time position as 

Director 

of the Institute de Higiene e Medicine Tropical 

The position of Director of the Institute of Hygiene and Tropical Medicine 
(IHMT), Universidade NOVA de Lisboa, is open for applications. 

The Director must have a doctorate degree (Ph.D.) or legal equivalent, scientific 
seniority in areas of interest to the mission of the IHMT {see at http://www. 
ihmt.unl.pt) and management experience. A good knovi/ledge of Portuguese 
and English is required. 

Candidates should demonstrate initiative and strategic vision, as well as 
capacity to implement the statutory mission of the IHMT, promoting national 
and international contacts. Successful candidates should guarantee proper 
functioning and good management of the various administrative and scientific 
units of the IHMT, also committing themselves to seek appropriate funding 
sources to strengthen IHMT strategic action in the area of Health and Life 
Sciences. 

Applications, submitted electronically before 12 pm on December 22, 2014, 
should include the following: 

1. application letter, addressed to the Chairman of the Electoral 
Commission; 

2. brief Curriculum vitae, with clear information on relevant professional 
experience; 

3. a plan of action (max. 3 pages) within the operating area of the 
IHMT, demonstrating strategic planning and management capacity in 
higher education, research, internationalization and cooperation for 
development. 

All documents should be sent to eleicaodiretor@ihmt.unl.pt along with an 
electronic confirmation of receipt. 

Salary will be determined in accordance to the Portuguese law, corresponding, 
as an indicative value, to a maximum monthly wage of 5,846.41 €, before 
taxes. 

Selected applicants will be interviewed at IHMT, in Lisbon, on January 26, 
2015. 


CIHL 


Postdoc TGF-B 


Leiden University Medical Center (LUMC) is committed to 
ongoing improvement of health care quality, invests in health- 
care related research, and intends to consolidate its leading 
international role in this area. 

■ The Department of Molecular Cell Biology has two Postdoctoral 
positions available in a dynamic, international and interdisciplinary 
research team to study the molecular basis of subverted TGF-13 
signal transduction, one in cancer and one in cardiovascular biology. 

■ Informal enquiries can be obtained from Prof Peter ten Dijke, 
e-mail p.ten_dijke@lumc.nl, telephone +31 (0)71 526 92 71. 

■ To apply, send your letter of application and CV, a description 
of research accomplishments, and contact information of two 
references to PenO-div4@lumc.nl. Please quote job reference 
number D. 14. VC.MP. 1 8/19/SC48 



www.lumc.nl 









Ludwig-Maximilians-Universitat (LMU) in Munich is one of the leading research universities in Europe, with a 
more than 500-year-long tradition. LMU aims at creating favorable conditions for excellent junior academics. To 
provide gifted early-career researchers in all subject areas with a long-term career perspective, LMU is offering 

Tenure Track Professorships to ERC Starting Grantees 

If you intend to submit a proposal to the current call 

of the European Research Council and would be inter- For further information regarding this offer, please see: 
ested in joining LMU's research community, please www.lmu.de/excellent/erc-tenuretrack 
contact the appropriate faculty member in your field 
of research as soon as possible. 
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GRADUATE SCHOOL OF 

Quantitative 

Biosciences 

Munich 


a 




Do you wish to ... 

. earn a PhD, building on your training in biochemistry, 
biology, physics, or applied maths? 

. conduct cutting-edge research at the interface of 
experiment and quantitative theory? 

. learn howto communicate and work with scientists 
from different fields? 

Then join us at QBM! 

The newly established Graduate School of Quantitative Biosciences Munich (QBM) 
is funded by the German Excellence Initiative and seeks to prepare young life scientists 
for the emerging era of quantitative, systems-oriented bioscience. It provides an 
innovative, international PhD training program that bridges the divide between 
traditionally separate disciplines, from biochemistry and medicine to bioinformatics, 
experimental and theoretical biophysics, and applied mathematics. 

Key elements of the program are an interdisciplinary research project jointly supervised 
by two Pis from different fields, and an educational curriculum centered around an 
intensive core course that integrates a wide range of approaches to biological 
problems. A multi-facetted mentoring and professional skills program supports the 
students’ growth as independent scientists. 

For more information, visit us at WWW.qblT1.lniU.d6 
Application deadline: January 5th, 2015 
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Join Germanys 
international 
research 
community! 



Discover the opportunities that 
Germany provides: 

H First-class doctoral training 

# Excellent career opportunities 
in research and industry 

# Attractive funding programmes 
for international researchers 


For the latest research news, 
job offers and more go to: 

www.research-in-germany.de 


Find US on Facebook: 
tacebook.com/ 
Research, in. Germany 



Follow us on Twitter: 
twitter.com/ 

Resea rchGerm any 


AN INITIATIVE OF THE 

# Federal Ministry 
of Education 
and Research 


Research in 
Germany 

Land of Ideas 

WWW. research-in-germQny.de 


Uppsala University is recruiting a 

Senior Lecturer in 

UNIVERSITET Molecuiar Detection 

at the Department of Immunology, Genetics and Pathology, Molecular Tools unit. Research 
in the Molecular Tools unit is focused on conceiving and developing advanced molecular 
analysis techniques with applications in medicine and biology, and to make these technologies 
broadly available. The work follows a strong local tradition for molecular tools development, 
reflected in an active biotech industry in Uppsala and Stockholm. Unlike the situation in 
most countries the so-called teacher's exemption in Sweden grants university employees 
full rights to their own inventions, allowing scientists to engage in translating their findings 
for industrial applications. Technologies previously developed by the Molecular Tools unit 
at Uppsala University have been licensed to eleven major international biotechnology or 
diagnostics companies, and their technologies have also formed the basis of five start-up 
companies spun out from the lab. 

The Senior Lecturer position includes research, teaching, and administration. The Senior 
Lecturer is expected to conduct and publish internationally competitive research within the 
molecular detection area, and to attract external funding for this research. 

The successful candidate for this post should display research experience in the area of 
innovative molecular detection and analysis technology, and expertise that complements work 
in the Molecular Tools research program. Examples of desired expertise include the ability to 
develop new important techniques for molecular analysis and to apply them in large-scale 
biomedical contexts, together with deep insights into molecular technological needs and 
requirements in biomedical research and clinical diagnostics. 

Information about the position can be given by Professor Ulf Landegren, -e46 18-471 49 10, 

ulf.landegren@igp.uu.se. 

Deadline for the appiication is December 21st 2014. 

For full advertisement and link to the application form go to 

http://www.uu.se/en/join-us/jobs/ 

Reference number UFV-PA 2014/3179 
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ENABLING TODAY'S 
SCIENTIFIC LEADERS 


International call for applications for two 
exceptional grant awards within the areas 
of biomedicine and biotechnology 

Novo Nordisk Foundation Laureate Research Grants are for 
outstanding established scientists to come to Denmark to 
strengthen their groundbreaking research programs. 

Applicants must have an independent research program that 
they have directed for 7 or more years in total. 

Grant funding 

Up to 40 million Danish kroner over 7 years 
(EUR ~5.4 million, USD ~6.7 million). 

NNF Laureate Research Grant holders can apply for continued 
NNF LRG funding, up to DKK 35 million over 7 additional years. 

Application deadline; February 10, 2015 
Further information 

To learn more about these grants, eligibility and the application 
process, please visit: 

www.novonordiskfonden.dk/en 

novo nordisk fonden 



SUPPORTING TOMORROW'S 
LEADING SCIENTISTS 


International call for applications for two 
exceptional grant awards within the areas 
of biomedicine and biotechnology 

Novo Nordisk Foundation Young Investigator Awards are for 
outstanding younger scientists to come to Denmark to 
expand their groundbreaking research programs. 

Applicants must have an independent research program that 
they have directed for less than 7 years in total. 

Grant funding 

■ Up to 20 million Danish kroner over 7 years 
(EUR -2.7 million, USD -3.4 million). 

■ NNF Young Investigator Award holders can apply for additional 
funding from other eligible Novo Nordisk Foundation programs. 

Application deadline: February 10, 2015 

Further information 

To learn more about these grants, eligibility and the application 
process, please visit: 

www.novonordiskfonden.dk/en 

novo nordisk fonden 
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Research Opportunities 
in Luxembourg. 

See what's behind it. 

.fff PEARL 

LUXEMBOURG'S RESEARCH PROGRAMME FOR INTERNATIONALLY 
RECOGNISED SENIOR RESEARCHERS 

Interested in establishing a high-profile research programme? 
Through our research programme PEARL (financial 
contribution up to EUR 5 million) we give you the opportunity to 
transferyour research programme to a research 
institution in Luxembourg. 


V ATTRACT 

LUXEMBOURG 'S RESEARCH PROGRAMME FOR OUTSTANDING 
YOUNG RESEARCHERS FROM ALL OVER THE WORLD 

Interested in doing scientific research at a high level in an 
international environment? Our research programme ATTRACT 
will allow you to set up your independent research team within 
a research institution in Luxembourg which will offer you 
attractive career opportunities. Funding up to EUR 2 million. 


More information about ATTRACT and PEARL as well as 

the other funding opportunities offered by the National Research 

Fund Luxembourg can be found on the FNR’s website. 

Go and see what’s behind on www.fnr.lu/pearl and 
www.fnr.lu/attract 

For an overview on research in Luxembourg, 
have a look at www.innovation.pubUc.lu 


n Fonds National de la 

Recherche Luxembourg 




advancing Uie frontiers 


Always craved 


for knowledge 


In 2015, 

CNRS 

is recruiting 

researchers 


Online registration atwww.cnrs.fr 

from December 1 , 201 4 to January 6, 201 5 

— — . _ sjjta 


Disabied candidates can aiso 
be recruited by contractual 
agreement 


In all scientific fields 


. Life sciences 
. Physics 

. Nuciear and Particie Physics 
. Chemistry 
. Mathematics 
• information technoiogies 
. Earth sciences and Astronomy 
. Humanities and Sociai sciences 
. Environmentai sciences and 
. Susfainabie deveiopment 
. Engineering 


INVESTIGATING FUTURE CHALLENGES 
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ARIZONA STATE UNIVERSITY 

Assistant/Associate Professor (JOB# 11032) 

The School of Life Sciences at Arizona State University invites applications for a tenure-track faculty 
position in Macrosystems Biology, at the level of either Assistant or Associate Professor. Rank and 
tenure status will be commensurate with experience. Anticipated start date is August 16, 2015. 

Applicants must have a PhD or equivalent by the time of appointment; expertise in ecology, 
environmental science, or a related discipline; computational skills for handling large sets of data; 
and a record of or potential to conduct cutting-edge research to address problems spanning multiple 
spatial scales (e.g., global change, biological invasions, or disease control). Desired qualifications 
include: (1) experience building or working with interdisciplinary teams, (2) a strong record of 
scholarly achievement, (3) extramural funding from diverse sources, (4) experience with teaching 
and mentoring at a university, (5) two years of postdoctoral training, (6) a record of research that 
combines theoretical modeling and empirical data, (7) potential to complement existing research in 
the School of Life Sciences and in other centers throughout the university are preferred. Examples of 
relevant centers include the Global Institute of Sustainability, the Center for Biodiversity Outcomes, 
and the Center for Environmental Science Applications, and the Complex Adaptive Systems Initiative. 

Arizona State University is a dynamic, progressive university dedicated to interdisciplinary 
collaboration, to student-centered education, and to reshaping the relationship between academia 
and society. The successful candidate will interact with interdisciplinary research teams within the 
university, teach at the undergraduate and graduate levels, and serve the school, the university, and 
its surrounding communities. 

To apply, please submit: (1) a cover letter addressed to Michael Angilletta, Chair, Macrosystems 
Biology Search Committee, (2) curriculum vitae, (3) three representative publications, (4) contact 
infonnation for three references, (5) a statement of research accomplishments and goals (emphasizing 
collaborative interdisciplinary research), and (6) a statement of teaching philosophy and experience. 
Application materials must be sent electronically in a single pdf file to solsfacultysearch4@asu.edu. 
The initial closing date for receipt of applications is January 2, 2015; applications will be reviewed 
weekly thereafter until the search is closed. A background check is required for employment. 

Arizona State University is a VEVRAA Federal Contractor and an Equal Opportunity/Affirmative 
Action Employer. All qualified applications will be considered without regard to race, color, sex, 
religion, national origin, disability, protected veteran status or any other basis protected by law. 
https://www.asu.edu/aad/manuals/acd/acd401.html https://www.asu.edu/titlelX/ 



AAAS is here - 

promoting universal science literacy. 

In 1985, AAAS founded Project 2061 with 
the goal of helping all Americans become 
literate in science, matematics, and tech- 
nology. With its landmark publications 
Science for All Americans and Benchmarks 
for Science Literacy, Project 2061 set out 
recommendations for what all students 
should know and be able to do in science, 
mathematics, and technology by the time 
they graduate from high school. 

As a AAAS member, your dues help 
support Project 2061. If you are not yet a 
member, join us. Together we can make a 
difference. 

To learn more, visit 
aaas.org/ plusyou/ project2061 

ilAAAS + U=A 


UCDAYIS 

Assistant Professor of Quantitative Wiidlife Ecoiogy 
Department of Wiidiife, Fish, and 
Conservation Bioiogy 

The University of California, Davis is recruiting a Quantitative Wildlife 
Ecologist at the tenure- track ASSISTANT PROFESSOR level, with 
academic year (9-month) tenn employment. Candidates must have the ability 
to develop a vigorous, extramurally-funded research program that addresses 
quantitative questions relevant to the ecology of wild vertebrates, including 
those in California and beyond, and to teach courses in quantitative wildlife/ 
fish ecology and estimation and share responsibility in other departmental 
teaching needs. We seek candidates with a demonstrated publication record 
in the development or application of tools and methods to estimate or analyze 
wildlife demography. Qualifications include Ph.D. in relevant discipline, and 
evidence of potential for accomplishment in research, teaching, and service. 
To obtain additional information and apply for the position please use the 
following website: https://recruit.ucdavis.edu/apply/JPF00442 . Additional 
inquiries can be directed to Associate Professor Brian D. Todd, Recruitment 
Advisory Committee Chair, Department of Wildlife, Fish, & Conservation 
Biology, One Shields Ave., University of California, Davis, CA 95616, 
Tel (530) 754-1140, FAX (530) 752-4154, email: btodd@ucdavis.edu. The 
position will remain open until filled but to ensure consideration, applications 
should be received by January 9, 2015. 

THE COLLEGE OF AGRICULTURAL AND ENVIRONMENTAL 
SCIENCES AND DEPARTMENT OF WILDLIFE, FISH and 
CONSERVATION BIOLOGY AT UC DAVIS ARE COMMITTED TO 
BUILDING A DIVERSE FACULTY, STAFF AND STUDENT BODY 
REFLECTING THE POPULATION AND EDUCATIONAL NEEDS OF 
CALIFORNIA AND THE NATION. 

UC Davis is an Affirmative Action/Equal Employment Opportunity 
Employer and is dedicated to recruiting a diverse faculty community. We 
welcome all qualified applicants to apply, including women, minorities, 
veterans, and individuals with disabilities. 
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BIOLOGY CHAIR SEARCH 


Geoi^a State 

Umversity 


The Department of Biology at Georgia State 
Umversity invites applications and nominations 
for a tenure-eligible Professor to fill a 12-month 
position of Department Chair preferably to begin 
July, 2015. The successful candidate must have a 
Ph.D. or equivalent degree, a distinguished record 
in biological or biomedical research that includes a history of significant 
external support, demonstrated leadership skills, and a strong commitment 
to graduate and undergraduate training and education. The Department Chair 
reports to the Dean of the College of Arts and Sciences and will provide 
visionary direction for collaborations and partnerships with other Departments, 
Schools, Centers, and Institutes of the University in accordance with its five- 
year University Strategic Plan (http://www.gsu.edu/about/govemance-and- 
strategy/). 


The Department of Biology consists of 55 full-time faculty with diverse 
research interests and a strong commitment to undergraduate education. The 
department offers Ph.D. degree programs with concentrations in Applied & 
Environmental Microbiology, Cellular and Molecular Biology and Physiology, 
Molecular Genetics and Biochemistry, and Neurobiology & Behavior as 
well as a strong M.S. degree program. The departmental research programs 
are supported by state-of-the-art core facilities. More information on the 
department can be found at www.biology.gsu.edu. 


Founded in 1913, Georgia State University is a leading urban research 
university located in the heart of downtown Atlanta, a growing, dynamic, 
diverse, and global metropolis. This diversity is reflected in its student 
population of more than 32,000, including more than 24,000 undergraduates, 
with additional growth anticipated. 


Applicants should arrange to have 3 reference letters as well as curriculum vitae, 
letter of intent, and samples of scholarly work and leadership effectiveness 
sent to biologychairsearch@gsu.edu. Applications from underrepresented 
minority and female candidates are enthusiastically encouraged. Application 
review will begin December 1, 2014, and the position will remain open until 
filled. Employment is contingent on background verification. 


Georgia State University, a part of the University System of Georgia, is an 
Equal Opportunity Educational Institution and an EEO/AA Employer. 




AAAS is here - 

helping scientists 
achieve career success. 

Every month, over 400,000 
students and scientists visit 
ScienceCareers.org in search of 
the information, advice, and 
opportunities they need to take 
the next step in their careers. 

A complete career resource, free 
to the public. Science Careers 
offers a suite of tools and services 
developed specifically for scientists. 
With hundreds of career develop- 
ment articles, webinars and 
downloadable booklets filled with 
practical advice, a community 
forum providing answers to career 
questions, and thousands of job 
listings in academia, government, 
and industry. Science Careers has 
helped countless individuals 
prepare themselves for successful 
careers. 

As a AAAS member, your dues help 
AAAS make this service freely avail- 
able to the scientific community. 

If you’re not a member, join us. 
Together we can make a difference. 


To learn more, visit 
aaas.org/plusyou/sciencecareers 
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Rutgers 

THE STATE UNIVERSITY 

OF NEW JERSEY 

The Duncan and Nancy MacMillan Chair in Cancer Genomics 

Rutgers, The State University of New Jersey, has a robust new program in Cancer Genetics, initially 
funded through philanthropy, in collaboration with the Department of Genetics, the Human Genetics 
Institute of New Jersey and the Rutgers Cancer Institute of New Jersey. The Department of Genetics 
seeks to appoint an accomplished senior scientist with clinical interests who pursues discoveries on 
the genetic bases of cancers and the application of novel information to diagnostics and/or treatments. 
The Candidate will also be appointed as a Full Member of the Cancer Institute of New Jersey, one of 
only 4 1 NCI-designated Comprehensive Cancer Centers. A secondary appointment to an appropriate 
clinical department is possible. The Chair will help structure an endowment-supported cuniculum in 
Cancer Genomics. The Chair is accompanied by significant new and existing resources and additional 
faculty appointments in related areas. Appointees shall have appropriate access to the advanced 
genomic technologies and computational resources of RUCDR Infinite Biologies®, a world leader 
in sample acquisition, processing and genomic analysis. Suitable recently-built laboratory space and 
computational resources are available. 

The Department of Genetics, located on the Rutgers Piscataway campus, is home to over 30 faculty 
with diverse interests and numerous well-funded research programs, and is part of a vibrant life 
sciences and computational community. The Cancer Institute of New Jersey, with its extensive 
research and clinical programs, is on the immediately adjacent Rutgers New Brunswick campus. 
Both are located in suburban central New Jersey, close to New York City, Philadelphia, beaches, 
and countryside. For more information on the Department and Rutgers see: http://genetics.rutgers. 
edu/faculty/faculty-recruitment. 

Candidates must have an M.D. and/or a Ph.D. or the equivalent in Genetics or a related field. They 
must have successful history of leadership, discovery, and publication as demonstrated by consistent, 
cun‘ent and significant grant funding and be eligible for appointment at the Professor or Distinguished 
Professor rank. Applicants should submit a CV, a detailed statement of research interests, a teaching 
statement, and full contact infoiination for three individuals willing to provide letters of reference. 
Applications should be submitted electronically at http://apply.interfoIio.com/27239 and inquiries 
made to Ms. Mary Carmona, carmona@dls.rutgers.edu. Review of applications will begin 
November 15, 2014 and continue until the position is filled. 


Faculty Position in Computational Genetics 

The Department of Genetics in the School of Arts and Sciences at Rutgers, The State University 
of New Jersey seeks an outstanding scientist to complement the existing faculty in computational 
genetics, moving our program into exciting new areas and expanding our existing strengths. Tenure- 
track or tenured appointment will be made at the Assistant, Associate, or Full Professor level. Areas of 
interest include, but are not limited to, population or evolutionary genetics, bioinfonnatics, statistical 
genetics, computational genomics, and analysis of complex genetic diseases. Experienced faculty with 
appropriate experience will be considered for a leadership role within our Computational Genetics 
Group. Department of Genetics faculty may also join the Human Genetics Institute of New Jersey. 
Core resources and generous startup funds will be provided. Research space, including wet lab if 
needed, will be provided in the newly constructed Life Sciences Building. 

The Department of Genetics is home to over 30 faculty with diverse interests and numerous well- 
funded research programs, hosts one of the oldest undergraduate majors in Genetics in the country, 
and is part of a vibrant life sciences and computational community. Our computational group 
collaborates with other Department of Genetics faculty and Rutgers scientists within the Division of 
Life Sciences, the Departments of Computer Science and Statistics, the Waksman Institute, the Center 
for Advanced Biotechnology and Medicine, the Robert Wood Johnson Medical School, the BioMaPS 
Institute for Quantitative Biology, the Center for Discrete Mathematics and Theoretical Computer 
Science (DIMACS), the Center for Human Evolutionary Studies, and the Cancer Institute of New 
Jersey. The New Brunswick/Piscataway campus is located in suburban central New Jersey, close to 
New York City, Philadelphia, beaches, and countryside. For more information on the Department 
and Rutgers see: http://genetics.rutgers.edu/faculty/faculty-recmitment. 

Candidates must have an M.D. and/or a Ph.D. or the equivalent in Genetics, Computer Sciences, 
or a related field, demonstrated record of significant research, the potential to make substantial 
contributions as an independent investigator, and have a commitment to teaching undergraduate 
and graduate students. Applicants should submit a CV, a detailed statement of research interests, a 
teaching statement, and full contact information for three individuals willing to provide letters of 
reference. Applications should be submitted electronically at http;//appIy.interfolio. com/22156 and 
inquiries made to Ms. Mary Carmona, carmona@dls.rutgers.edu. Review of applications will 
begin November 1, 2014 and continue until the position is filled. 

Rutgers, the State University of New Jersey, is an Equal Opportunity / Affirmative Action 
Employer. Qualified applicants will be considered for employment without regard to race, creed, 
color, religion, sex, sexual orientation, gender identity or expression, national origin, disability 
status, genetic information, protected veteran status, military service or any other category 
protected by law. As an institution, we value diversity of background and opinion, and prohibit 
discrimination or harassment on the basis of any legally protected class in the areas of hiring, 
recruitment, promotion, transfer, demotion, training, compensation, pay, fringe benefits, layoff, 
termination or any other terms and conditions of employment. 





Learn more and bring your job search 
in for a smooth landing. 


• Search thousands of job postings 

• Create job alerts based on your criteria 

• Get career advice from our Career Forum experts 

• Download career advice articles and webinars 

• Complete an individual development plan at “myIDP” 


Science Careers 


Target your job search using relevant resources 

on ScienceCareers.org. 


FROM THE JOURNAL SCIENCE MAAAS 


Hunting for Talents 
NUAA, Jiangsu, China 

Nanjing University of Aeronautics and Astronautics(NUAA) is a research-oriented national key university of “211 Project”. 
It also enjoys a well-balanced development of multiple disciplines in engineering, technology, natural sciences, economy, 
management and social sciences with the characteristics of aeronautics, astronautics and civil aviation. NUAA is qualified to 
be “Dominant Discipline Innovation Platform of 985 Project” and to independently recruit and receive international students 
who are granted the Chinese Government Scholarship. Now NUAA consists of 16 colleges with more than 3,000 staff 
members and approximately 26,000 degree students. 

Academia and education at NUAA represent strong capacity among all the universities in China. It has acquired national 
status through the quality of its excellence research work, especially in the areas of Aerospace Engineering, 
Mechanics,Electromechanics, Economy and Management, etc. 

NUAA gives a warm welcome to excellent experts, scholars and young students from both home and abroad, who are willing 
to serve the country, dedicate themselves to the development of aerospace science and make contributions to the 
industrialization, information technology of China. NUAA will provide teachers and researchers with a good academic 
environment, satisfactory working and living conditions and a stage on which they can put their talents to good use. 



Ms. Zhao Haiyan, Mr. Cao Yunxing 
Personnel Division, NUAA 

Address: 29# Yudao St. Nanjing, Jiangsu Province, Postcode: 210016 

Tel: -t86-25-84892461 

Fax: -t86-25-84895923 

Email: zhaohaiyan@nuaa.edu.cn 

Web:http://www.nuaa.edu.cn/nuaanew http://rsc.nuaa.edu.cn 


o 



Faculty Positions available at 
Hohai University, Naniins, China 


Hohai University invites applieations for 
faeulty positions at the assistant, assoeiate, or 
full professor level in the area of engineering, 
seienee, eeonomies, management, liberal arts, 
and law. Applieants should have a doetoral 
degree from a prestigious university. For the 
eomplete job announeements and direetions on 
how to apply, visit: rse.hhu.edu.en or eontaet the 
Department of human resouree at 
86-25-83786205. 


Hohai University, founded in 1915, wins its 
worldwide reputation on the researeh of Water 
Seienee& Civil Engineering&Environment 
Engineering. It is a National key university of 
China, and among the universities of the 
National “2U (project” and Innovation Bases of 
the National ''985 (Project”. Hohai University 
aims to be a researeh oriented university. 






Southwest Jiaotong University, P.R.China 
Anticipates Your Working Application 


Southwest Jiaotong University (SWJTU), founded in 1896, situates itself in Chengdu, the provincial capital of Sichuan. 
It is a national key multidisciplinary “2 1 1 ” and “985 Feature" Projects university directly under the jurisdiction of the 
Ministry of Education, featuring engineering and a comprehensive range of study programs and research disciplines 
spreading across more than 20 faculties and institutes/centers. Boasting a complete Bachelor-Master-Doctor education 
system widt more than 2,500 members of academic staff, our school also owns 2 first-level national key disciplines, 2 
supplementary first-level national key disciplines(in their establishment), 15 first-level doctoral programs, 43 first-level 
master programs, 75 key undergraduate programs, 1 0 post-doctoral stations and more than 40 key laboratories at national 
and provincial levels. 

Our university is currently implementing the strategy of “developing and strengthening the university by introducing and 
cultivating talents". Therefore, we sincerely look forward to your working application. 

More information available at http://www.swjtu.edu.cn/ 


I. Positions and Require 


A. High-level Leading Talents 

It is required that candidates be listed in national top talents programs such as Program of Global Experts, Top Talents 
of National Special Support Program, “Chang Jiang Scholars”, China National Funds for Distinguished Young 
Scientists and National Award for Distinguished Teacher. 

Candidates are supposed to be no more than 50 years old. The limitation could be extended in the most-needed areas of 
disciplinary development. 

Candidates who work in high-level universities/institutes and reach the above requirements are supposed to be no more 
than 45 years old. 

B. Young Leading Scholars 

Candidates are supposed to be listed in or qualified to apply for the following programs: 

•National Thousand Young Talents Program 

•The Top Young Talents of National Special Support Program(Prograin for Supporting Top Young Talents) 

•Science Foundation for the Excellent Youth Scholars 

Candidates should have good team spirit and leadership, outstanding academic achievements, broad academic vision and 
international cooperation experience and have the potential of being a leading academic researcher. 

C. Excellent Young Academic Backbones 

Candidates under 40 years old are expected to graduate from high-level universities/institutes either in China or other 
countries. Those who are professors, associate professors and other equal talents from high-level universities/institutes 
overseas could be employed as professors and associate professors as well. 

D. Excellent Doctors and Post Doctoral Fellows 

Candidates under 35 years old are supposed to be excellent academic researchers from high-level universities either in 
China or other countries. 


II. Treatments 


The candidates will be provided with competitive salaries and welfares that include settling-in allowance, subsidy of 
rental residence, start-up funds of scientific research, assistance in establishing scientific platform and research group as 
well as international-level training and promotion . As for outstanding returnees, we can offer further or specific 
treatments that can be discussed personally. 



Contacts: Ye ZENG & Yinchuan LI Telephone number; 86-28-66366202 Email: talent@swjlu.edu.cn 

Address: Human Resources Department of SWJTU, the western park of high-tech zone, Chengdu, Sichuan, RR.China, 
611756 


http://www.swjtu.edu.cn/ 


OPPORTUNITIES IN CHINA ScienceCareers online @sciencecareers. 





WORKING LIFE 


By Trisha Gura 


The art of entrepreneurship 


obert S. Langer, the David H. Koch Institute Professor at the Massachusetts Institute of 
Technology (MIT) in Cambridge, is the most cited engineer in history, with more than 
163,000 citations. He holds more than 1000 patents, licensed or sublicensed to more than 
300 companies, and he has helped found at least two dozen biotechnology companies. This 
interview has been edited for brevity and clarity. 


Q: Why did you become a scientist? 

A: In my day, there was this 
company called A. C. Gilhert that 
made these great erector, chem- 
istry, and microscope sets. I got 
them for various birthdays. Later 
on, in high school, I was good in 
math and science and terribie in 
everything else. People would say 
to me, “You should become an 
engineer.” I thought engineers ran 
railroad cars. I went to Cornell and 
majored in engineering. My first 
term, the only class I liked and did 
OK in was chemistry, so I decided 
I would be a chemical engineer. 

When I graduated with my 
doctorate in 1974, the standard 
chemical engineering job was to 
work at an oil company. I wasn’t 
excited about that. So I kept look- 
ing for things I felt would make an impact, that would help 
people. I ended up working as a postdoc with Judah Folk- 
man, a surgeon at Boston Children’s Hospital. He had this 
idea that if you could stop blood vessels, maybe that would 
be a way to stop cancer. I thought, “Boy, this is a lot better 
than oil companies.” If it worked, it would be incredible. 

Q: How did you go from scientist to entrepreneur? 

A: My project was to isolate the first angiogenesis inhibi- 
tors. We invented microparticles that release an inhibitor 
of blood vessel growth to starve the growing tumors. I 
thought, “We’ll publish our work on microparticles, and 
everybody will use them clinically.” And then nobody did. 
Nobody used them to help people. So we patented them 
and licensed them. After about 10 years, somebody called 
me. They were going to work with the microspheres we 
developed, but they did only an experiment or two a 
year. I was frustrated. About a year later, Alex Klibanov, a 
professor at MIT, said, “Bob, we should start a company.” 

Q: What, in your view, is the most common mistake 
scientists make when trying to iaunch a company? 

A: Starting too early. It is a long road in the medical 





“Don’t sacrifice publishing 
good science to be secretive.” 


area. If you start up too early, you 
may have trouble getting invest- 
ment. Even if you succeed, then 
by the time you get to clinical 
trials, the investors may get tired. 


Q: What is the connection 
between graduate science educa- 
tion and entrepreneurship? 

A: When you’re a student, you’re 
judged by how well you answer 
questions. But in life, you’re 
judged by how good your ques- 
tions are. You want students 
and postdocs to transition from 
giving good answers to asking 
good questions. Then they’ll 
become great professors, great 
entrepreneurs, great something. 

When somebody is a student 
or postdoc, what is going to help 
them through is to be stretched. Feeling some of that 
discomfort, knowing how to get through it— the fact that 
you can prove to yourself that you can get through, and 
you can do well— that is wonderful, as long as it is not 
too painful. 

Q: How do you advise scientists to approach their careers? 

A: I always tell people, “Just follow your heart. Pick 
something you think you are going to love.” To me, that is 
the most important thing. 

Q: What is your take-home message to budding 
scientist-entrepreneurs? 

A: Do great science. Don’t sacrifice publishing good sci- 
ence to be secretive. Then go to the next step and patent 
it— and do licensing and start companies. That can be 
incredibly fulfilling because it gives you the opportunity to 
take your ideas— and give your students the opportunities 
to take their ideas along with you— and create things that 
could change the world and make it a better place. ■ 


Trisha Gura is a freelance writer who lives in Boston. For 
more on life and eareers visit www.sciencecareers.org. 
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